LNCS3106 



I Kyung-Yong Chwa 
J. Ian Munro (Eds.) 

Computing 
and Combinatorics 

10th Annual International Conference, COCOON 2004 
Jeju Island, Korea, August 2004 
Proceedings 



^ Springer 




Lecture Notes in Computer Science 

Commenced Publication in 1973 
Founding and Former Series Editors: 

Gerhard Goos, Juris Hartmanis, and Jan van Leeuwen 

Editorial Board 

David Hutchison 

Lancaster University, UK 
Takeo Kanade 

Carnegie Mellon University, Pittsburgh, PA, USA 
Josef Kittler 

University of Surrey, Guildford, UK 
Jon M. Kleinberg 

Cornell University, Ithaca, NY, USA 
Friedemann Mattern 

ETH Zurich, Switzerland 
John C. Mitchell 

Stanford University, CA, USA 
Moni Naor 

Weizmann Institute of Science, Rehovot, Israel 
Oscar Nierstrasz 

University of Bern, Switzerland 
C. Pandu Rangan 

Indian Institute of Technology, Madras, India 
Bernhard Steffen 

University of Dortmund, Germany 
Madhu Sudan 

Massachusetts Institute of Technology, MA, USA 
Demetri Terzopoulos 

New York University, NY, USA 
Doug Tygar 

University of California, Berkeley, CA, USA 
Moshe Y. Vardi 

Rice University, Houston, TX, USA 
Gerhard Weikum 

Max-Planck Institute of Computer Science, Saarbruecken, Germany 



3106 




Kyung-Yong Chwa J. Ian Munro (Eds.) 

Computing 
and Combinatorics 



10th Annual International Conference, COCOON 2004 
Jeju Island, Korea, August 17-20, 2004 
Proceedings 




Springer 




Volume Editors 



Kyung-Yong Chwa 

Korea Advanced Institute of Science and Technology 
Department of Electrical Engineering & Computer Science 
373-1 Guseong-dong Yuseong-gu, Daejeon 305-701, Republic of Korea 
E-mail: kychwa@jupiter.kaist.ac.kr 

J. Ian Munro 

University of Waterloo, School of Computer Science 

200 University Avenue West, Waterloo, ON, Canada N2L 3G1 

E-mail: imunro@uwaterloo.ca 



Library of Congress Control Number: Applied for 

CR Subject Classification (1998): E.2, G.2.1-2, 1.3.5, C.2.3-4, E.l, E.5, E.4 
ISSN 0302-9743 

ISBN 3-540-22856-X Springer Berlin Heidelberg New York 



This work is subject to copyright. All rights are reserved, whether the whole or part of the material is 
concerned, specifically the rights of translation, reprinting, re-use of illustrations, recitation, broadcasting, 
reproduction on microfilms or in any other way, and storage in data banks. Duplication of this publication 
or parts thereof is permitted only under the provisions of the German Copyright Law of September 9, 1965, 
in its current version, and permission for use must always be obtained from Springer. Violations are liable 
to prosecution under the German Copyright Law. 

Springer is a part of Springer Science-l-Business Media 

springeronline.com 

© Springer-Verlag Berlin Heidelberg 2004 
Printed in Germany 

Typesetting: Camera-ready by author, data conversion by Olgun Computergrafik 
Printed on acid-free paper SPIN: 1 1304272 06/3142 5 4 3 2 1 0 




Preface 



The papers in this volume were selected for presentation at the 10th International 
Computing and Combinatorics Conference (COCOON 2004), held on August 
17-20, 2004 in Jeju Island, Korea. Previous meetings were held in Xi’an (1995), 
Hong Kong (1996), Shanghai (1997), Taipei (1998), Tokyo (1999), Sydney (2000), 
Guilin (2001), Singapore (2002), and Big Sky (2003). 

In response to the call for papers, 109 extended abstracts were submitted 
from 23 countries, of which 46 were accepted. The submitted papers were from 
Belgium (1), Canada (5), China (6), France (1), Germany (6), Hong Kong (8), 
India (6), Iran (1), Ireland (1), Israel (4), Italy (2), Japan (17), Korea (23), 
Mexico (3), New Zealand (1), Poland (1), Russia (1), Singapore (5), Sweden (2), 
Switzerland (3), Taiwan (2), the UK (1), and the USA (9). 

Each paper was evaluated by at least three program committee members, 
with the assistance of referees, as indicated by the referee list found in these 
proceedings. There were many more acceptable papers than there was space 
available in the conference schedule, and the program committee’s task was 
extremely difficult. In addition to selected papers, the conference also included 
three invited presentations by Lars Arge, Jeong Han Kim, and Kokichi Sugihara. 

We thank all program committee members and their referees for their ex- 
cellent work, especially given the demanding time constraints; they gave the 
conference its distinctive character. We thank all who submitted papers for con- 
sideration: they all contributed to the high quality of the conference. 

Finally, we thank all the people who worked hard to put in place the logistical 
arrangements of the conference — our colleagues and our graduate students. It 
is their hard work that made the conference possible and enjoyable. 
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Kyung-Yong Chwa 
J. Ian Munro 
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External Geometric Data Structures 

(Invited Paper) 

Lars Arge* 

Department of Computer Science, Duke University, Durham, NC 27708, USA 

large@cs . duke . edu 

Many modern applications store and process datasets much larger than the main 
memory of even state-of-the-art high-end machines. Thus massive and dynami- 
cally changing datasets often need to be stored in space efficient data structures 
on external storage devices such as disks, and in such cases the Input/Output 
(or I/O) communication between internal and external memory can become a 
major performance bottleneck. Many massive dataset applications involve geo- 
metric data (for example, points, lines, and polygons) or data that can be inter- 
preted geometrically. Such applications often perform queries that correspond 
to searching in massive multidimensional geometric databases for objects that 
satisfy certain spatial constraints. Typical queries include reporting the objects 
intersecting a query region, reporting the objects containing a query point, and 
reporting objects near a query point. 

While development of practically efficient (and ideally also multi-purpose) 
external memory data structures (indexes) has always been a main concern in 
the database community, most data structure research in the algorithms com- 
munity has focused on worst-case efficient internal memory data structures. Re- 
cently however, there has been some cross- fertilization between the two areas. 
In this talk we discuss some of the recent advances in the development of dy- 
namic and worst-case efficient external memory geometric data structures. We 
focus on fundamental structures for important one- and two-dimensional range 
searching related problems, and try to highlight some of the fundamental tech- 
niques used to develop such structures. More comprehensive surveys of external 
data structure results, as well as external memory algorithms, can e.g. be found 
in [1-3]. 

References 

1. L. Arge. External memory data structures. In J. Abello, P. M. Pardalos, and 
M. G. C. Resende, editors. Handbook of Massive Data Sets, pages 313-358. Kluwer 
Academic Publishers, 2002. 

2. L. Arge. External-memory geometric data structures. In G. S. Brodal and R. Fager- 
berg, editors, EEF Summer School on Massive Datasets. Springer Verlag, 2004. To 
appear. 

3. J. S. Vitter. External memory algorithms and data structures: Dealing with MAS- 
SIVE data. ACM Computing Surveys, 33(2):209-271, 2001. 
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9972879, ITR grant EIA-0112849, GAREER grant GGR-9984099, and U.S.- 
Germany Gooperative Research Program grant INT-0129182. 
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The Poisson Cloning Model 
for Random Graphs, Random Directed Graphs 
and Random fc-SAT Problems 



Jeong Han Kim 
Microsoft Research 

One Microsoft Way, Redmond, WA 98052, USA 
jehkimSmicrosoft . com 

In the (classical) random graph model G{n,p) with p = 0(l/n), the degrees of 
the vertices are almost i.i.d Poisson random variables with mean d = pn+0{l/n). 
Though this fact is useful to understand the nature of the model, it has not been 
possible to fully utilize properties of i.i.d Poisson random variables. For example, 
the distribution of the number of isolated vertices is very close to the binomial 
distribution B{n, de~‘^). In a rigorous proof, however, one has to keep saying how 
close the distribution is and tracking the effect of the small difference, which is 
not necessary if the degrees are exactly i.i.d Poisson. Since these kinds of small 
differences occur almost everywhere in the analysis of the random graph, they 
make rigorous analysis significantly difficult, if not impossible. 

As an approach to minimize such non-essential parts of the analysis, we 
introduce a random graph model, called the Poisson cloning model, in which all 
degrees are i.i.d Poisson random variables. Similar models may be introduced 
for random directed graphs and random fc-SAT problems. We will first establish 
theorems saying that the new models are essentially equivalent to the classical 
models. To demonstrate how useful the new models are, we will completely 
analyze some of well-known problems such as the fc-core problems of the random 
graph, the strong component problem of the random directed graph and/or the 
pure literal algorithm of the random fc-SAT problem. 

This lecture will be self-contained, especially no prior knowledge about the 
above mentioned problems are required. 
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Robust Geometric Computation 
Based on Digital Topology 



Kokichi Sugihara 



Department of Mathematical Informatics 
University of Tokyo 

7-3-1 Kongo, Bunkyo-ku, Tokyo 113-8656, Japan 
sugiharaOmist . i .u-tokyo .ac.jp 



Abstract. This paper presents a new scheme for designing numeri- 
cally robust geometric algorithms based on topological consistency. The 
topology-based approach is one of the most successful principle for de- 
signing robust algorithms to solve geometric problems, which the au- 
thor’s group has been developed for a long time. This approach gen- 
erates efficient algorithms because the floating-point arithmetic can be 
used, but is not a beginners’ technique because topological invariants for 
individual problems are required in the design of the algorithms. The 
new scheme presented here uses digital topology instead of individual 
invariants, and hence will change the topology-based approach from a 
middle-level technique to a beginners’ level technique. The basic idea 
together with its application to wire bundling is presented. 



1 Introduction 

Naive implementation of geometric algorithms usually generates unrobust soft- 
ware even though the algorithms are translated into computer language cor- 
rectly. This is mainly because the algorithms are designed on the assumption 
that numerical computation can be done correctly, while actual computers gen- 
erate numerical errors in computation. Numerical errors cause misjudgement on 
topological structures, generate inconsistency, and thus make software to fail. 

To overcome this difficulty, many approaches have been proposed. We can 
classify them into three groups: the exact computation approach [13,18], the 
topology-based approach [12,15], and the others [1,7]. 

In exact computation approach, the resolution of the input data is restricted, 
and computation is done in precision that is high enough to judge the topological 
structure always correctly [18]. 

This approach requires high cost in computation, but general because the 
same strategy can be applied to a wide class of geometric problems. In this 
sense, we may say that this approach is suitable to beginners. 

In the topology-based approach, on the other hand, we do not expect any 
accuracy in numerical computation; we place higher priority on keeping the con- 
sistency of the topological consistency, and use numerical information only when 
it is consistent with the topological structure [16]. In this approach, we can use 
floating-point arithmetic, and hence the resulting computer program runs fast. 
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However, this approach requires a certain insight to individual problems, because 
we have to extract topologically invariant properties on which the robustness re- 
lies. Hence, we may say that this approach is a middle-level technique. 

These two approaches are extremes in mutually opposite directions; the first 
approach uses perfectly reliable computation, while the second approach does 
not rely on numerical computation at all. 

Other approaches to robustness can be classified into the third group in 
the sense that they are placed between the first two extreme approaches [1, 
7]. In these approaches, they assume a certain fixed precision in computation. 
In each computation, they evaluate the error, and judge whether the result is 
reliable. If reliable, they use it; otherwise they do something else. In this way, 
the resulting software has a binary branch of processing at every computation. 
Moreover, how to construct the branches much depends on individual problems 
to be solved. Therefore, we may say that these approaches are the professional- 
level techniques. 

In this paper, we concentrate on the second approach, the topology-based 
approach, and propose a new scheme for changing it from the middle-level tech- 
nique to a beginners’ technique. The basic idea is the use of digital picture 
approximation. We first solve the problem approximately on a digital picture, 
next extract the topological structure from the digital picture, and finally use 
this topological structure in the topology-based approach. Because we can ex- 
tract a consistent topological structure from the digital picture approximation, 
the most difficult step in the original topology-based approach is replaced with 
an almost automatic step, which is easily carried out. 

The original topology-based approach is reviewed in Section 2, and the new 
scheme is presented in Section 3. An example is given in Section 4, and the 
conclusion in Section 5. 



2 Topology-Based Approach 

Suppose that we are given an algorithm for solving a geometric problem such as 
the construction of a convex hull and the construction of a Voronoi diagram. Let 
us assume that numerical computation contains errors. A naive implementation 
of the algorithm usually generates unstable software which fails easily. 

Fig. 1(a) shows how a conventional algorithm fails. Let S = {Ji, J 2 , • • • , Jn} 
be the set of all the predicates that should be checked in the algorithm. Whether 
those predicates are true or not is judged on the basis of numerical computations. 
Since numerical computations contain errors, some of the predicates may be 
judged incorrectly, which in turn generate inconsistency and the algorithm fails. 

Numerical errors cannot be avoided in computation, but still we want to 
avoid inconsistency. To achieve this goal, the topology-based approach tries to 
change the structure of the algorithm as shown in Fig. 1(b). We first try to find 
a maximal subset, say S' , of predicates that are independent from each other, 
where “independent” means that the truth values of any predicates in S' do not 
affect the truth values of the other predicates in this subset. The other predicates 
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numerical computation 
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failure 



numerical computation 
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normal termination 



(b) 

Fig. 1. Basic idea of the topology-based approach. 



are dependent in the sense that their truth values are determined as the logical 
consequence of the truth values of the predicates in S' . 

Once we succeed in dividing the predicate set S into a maximal independent 
subset S' and the other set, we can avoid inconsistency in the following manner. 
We decide the truth values of the predicates in S' by numerical computation, 
and adopt the logical consequences of them as the truth values of the other 
predicates, i.e., the predicates in S' — S'. Since the predicates in S' are indepen- 
dent, any assignment of the truth values to these predicates does not generate 
inconsistency. Moreover, since we adopt the logical consequences of these truth 
values as the truth values of the predicates in S — S', inconsistency never arises. 

Note that we cannot guarantee the correctness of the truth values in S' 
because we have numerical errors. However, once we believe the result of the 
numerical computation, we can construct a consistent world. This is the basic 
idea for avoiding inconsistency in the topology-based approach. 

The next problem is how to find the maximal independent subset S' of the 
predicates. This is the most difficult part, and much depends on individual algo- 
rithms. We extract purely topological properties of geometric objects, and judge 
a predicate independent if the topological properties are not violated no mat- 
ter whether the predicate is true or false. This is the reason why we call this 
approach the “topology-based approach” . 

The following are examples of typical topological properties. In the incre- 
mental algorithm for constructing the Voronoi diagram for points in the plane, 
“the substructure to be deleted by the addition of a new generator should be a 
tree in the graph theoretic sense” [14, 15]. In the algorithm for cutting a convex 
polyhedron by a plane, “both of the partitioned vertex-edge graphs of the poly- 
hedron should be connected” [11]. In the incremental algorithm for the Voronoi 
diagram for line-segments, “the subgraph to be deleted by the addition of a new 
line-segment should contain a path connecting two Voronoi regions that con- 
tain the terminal of the line-segment” [2,16]. Other examples can be found in 
the construction of three-dimensional Voronoi diagram [3], the construction of 
three-dimensional convex hulls [8], and the divide-and-conquer construction of 
two-dimensional Voronoi diagrams [10]. 
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Though there are many successful cases as stated above, it is still a bottle neck 
to extract purely topological properties for each algorithm. We will overcame this 
difficulty in the next section. 

3 Use of Digital Topology 

The most difficult step in the design of geometric algorithms according to the 
topology-based approach is to extract purely topological properties that the 
objects should satisfy. To circumvent this difficulty, we propose a new scheme, in 
which the topological structure is automatically extracted from a digital-picture 
approximation. The basic idea is as follows. 

As shown in Fig. 2, we first solve the given problem approximately by the 
digital-picture technique. In the two-dimensional space, the digital picture is an 
array of small squares called pixels. In the three-dimensional space, on the other 
hand, the digital picture is an array of small cubes called boxels. In both cases 
the space is filled with regular elements, and each element is assigned specific 
attributes to represent a geometric object approximately. 



Geometric problem 




Topologically consistent solution 



Fig. 2. Geometric computation based on digital topology. 



For example, if we want to construct the Voronoi diagram in the plane, 
we assign to each pixel a specific color corresponding to the nearest generator. 
This representation is very flexible in the sense that any generalized Voronoi 
diagram can be constructed in the same manner once we are given a generalized 
distance between pixels and generators. Indeed for some types of generalized 
Voronoi diagrams such as the crystal growth Voronoi diagram [6] and the boat- 
sail Voronoi diagram [9], the digital-picture approximation seems to be the only 
way for constructing the Voronoi diagram in reasonable time. 
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Historically, the digital picture approach has not been considered so seriously 
in computational geometry. This may be mainly because this is a kind of brute- 
force method, and gives only an approximation of the true resolution; if we want 
a sufficiently good approximation, we need very high resolution of the pixel/boxel 
array and hence we need much time in computation. 

For our present purpose, however, high resolution is not necessary. What 
we need is an approximation of the topological structure, and hence only a 
low resolution approximation is sufficient. Hence, this approximation can be 
constructed efficiently. 

In the second step, we extract the topological structure of the object from the 
digital picture approximation. For example, if the object is the Voronoi diagram 
in the plane, we extract the graph structure consisting of Voronoi vertices and 
Voronoi edges. This can be done automatically, because the Voronoi vertices are 
points at which three regions meet, and the Voronoi edges are lines at which two 
regions meet. 

By doing so, we can skip extracting invariant topological properties. The 
space-filling nature of the digital picture automatically tells us a specific example 
of the topological structure that is consistent in the sense that this topology is 
actually realized by the example of geometric object represented as the array of 
pixels or boxels. Therefore, no matter how sophisticatedly the Voronoi diagram 
is generalized, the common procedure can be applied to get the topological 
structures. 

Once we get the topological structure, we believe this is the correct topologi- 
cal structure just as we do in the topology-based approach. In the third step, we 
compute the geometric attributes of the obtained topological structure such as 
the coordinates of the Voronoi vertices and the shapes of the Voronoi edges, and 
thus generate the final output of the algorithm. Note that this output can be 
considered as an approximation of the correct solution, and at least the topolog- 
ical consistency is guaranteed because this topology is actually realized by the 
digital-picture approximation. 

Recall that a fatal error of geometric computation comes from topological 
inconsistency of a geometric object. When we deal with a digital picture, the 
space- filling structure of pixels/boxels has fixed at the beginning and is never 
changed. Hence, the topological consistency is kept very easily by representing 
the geometric object in terms of the digital picture. This way we can avoid incon- 
sistency almost automatically. Thus we can change the topology-based approach 
from a middle-level technique to a beginners’ level technique. 



4 Example: Circle Voronoi Diagram 
and Its Application to Wire Bundling 

In the design of modern industrial products such as planes and cars, we often 
want to estimate the size of a wire bundle for a given set of categories and 
numbers of electric wires. Fig. 3 shows a section of such a wire bundle. 
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Fig. 3. Example of a wire bundle. 



A simplest way to treat this problem is to reduce it to a two-dimensional 
disk packing problem: given a set of disks, we want to find the smallest enclosing 
circle containing all the disks without overlap. To solve this problem strictly is 
very hard; so we have to be satisfied with a nearly smallest circle. 

One successful method for this problem is a “shake-and-shrink” algorithm 
[17]. In this algorithm, we start with a sufficiently large enclosing circle, and 
simulate the behavior of the disks when we keep shaking the disks and shrink 
the enclosing circle step by step. In this algorithm a crucial step is to find empty 
spaces at which a new disk can be placed without intersecting other disks. This 
step is repeated a large number of times in the simulation, and the circle Voronoi 
diagram can reduce the computational time drastically. 

Let E be the region bounded by the enclosing circle, and dE be its boundary. 
Let D = {di,d 2 ,- ■ ■ , dn} be a set of n disks placed without overlap in E. For any 
point set X in the plane, let d(p,X) denote the Euclidean distance from point 
p to X. We define the Voronoi region of c?i {i = 1, 2, • • • , n) as 

R{d^) = {p£ E \ d{p,di) < d{p,dj) for j ^ i, d{p,pi) < d{p,dE)}, 

and the Voronoi region of dE as 

R{dE) = {p £ E \ d{p,dE) < d{p,di) for any i}. 

The region E is partitioned into the Voronoi regions R{di) , R{d2) , ■ ■ ■ , R{dn) , 
R{dE) and their boundaries. We call this partition the circle Voronoi diagram 
for the disk set D and the enclosing circle dE, and denote it by V{D,dE). 
Curves shared by the boundaries of two regions are called Voronoi edges, and 
points shared by the boundaries of three or more regions are called Voronoi 
vertices. Fig. 4 shows an example of the circle Voronoi diagram. 

The circle Voronoi diagram contains curved Voronoi edges, and hence is not 
so easy to compute as the ordinary Voronoi diagram for points. For example, see 
[4, 5] for an algorithm and implementation. 
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Fig. 4. Circle Voronoi diagram in a circle. 



Suppose that we have n nonoverlapping disks di,d 2 , - ■ ■ ,dn and the enclosing 
circle dE, and we want to find all possible placement of the (n+ l)-th disk d„+i 
touching two other disks or the ecnlosing circle dE. If we search for them in a 
naive manner, it takes O(n^) time because there are O(n^) placements of d„_i 
touching two of the other disks, and for each placement 0(n) additional time is 
required to check whether it intersects other disks. 

Using the circle Voronoi diagram, we can decrease the time complexity dras- 
tically. 

If the disk dn+i touches two other disks, say di and dj, then the center of 
dn+i is in equal distance from di and dj, which means that the center of d„+i is 
on the Voronoi edge shared by R{di) and R{dj) or on its extension. The center 
of dn+i being on the Voronoi edge also means that does not intersect any 
other disks, because the Voronoi edge consists of points that are nearer to di 
and dj than to any other disks. 

For any point on a Voronoi edge, let us define the clearance as the distance to 
the two side disks. Let r„+i be the radius of dn+i - The disk d„+i can be placed 
on a Voronoi edge without intersecting other edges if and only if the clearance 
at the center of dn+i is equal to or greater than r„+i. Note that the clearance at 
a point on a Voronoi edge gives its maximal value at one of the terminal points 
of the edge. Because of these properties, we can find all possible placements of 
dn+i in the following manner. 

Suppose that we already constructed the circle Voronoi diagram V{D,dE). 
Also suppose that for each Voronoi vertex we assigned the clearance. We first 
select only those Voronoi vertices whose clearance is equal to or greater than 
There are only 0(n) vertices, and hence the number of chosen vertices is 
also 0(n) at most. Next, for each of those vertices, we check the incident edges 
to find the center of dn+i on the edge such that it touches two other disks. 
Since the total number of Voronoi edges is also bounded by 0(n), we can find 
all possible placements of dn+i in 0(n) time. 
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In this way, the time complexity of finding all possible placements of dn+i 
touching two other disks and avoiding the interference with other disks can be 
reduced from O(n^) to 0(n), if we have the circle Voronoi diagram. In this 
argument, we neglect the time necessary for constructing the Voronoi diagram, 
but this is reasonable. Indeed, the shake-and-shrink algorithm uses many circle 
Voronoi diagrams that differs from each other only slightly, and hence the average 
time complexity for generating one circle Voronoi diagram is negligible. 

For this application, we essentially need the Voronoi vertices and their clear- 
ances. Hence, the proposed method can be used effectively. Fig. 5 shows an 
example. Fig. 5(a) represents the disks and the enclosing circle. Fig. 5(b) shows 
the digital picture representing an approximation of V {D, E). In this digital pic- 
ture we chose the lowest resolution provided that the Voronoi region of each disk 
contains at least one pixel, and hence we could construct it very quickly. 

From this digital picture, we extracted the topological structure, i.e., the 
Voronoi vertices and their adjacency relations. Finally, we computed the clear- 
ances of the Voronoi vertices, as in Fig. 5(c), where the clearances are shown by 




(a) 





(c) 



Fig. 5. Digital picture approximation, and the computed vertices and clearances. 
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locally maximal empty circles represented by broken curves. Using these clear- 
ance information, we can find all possible placements of dn+i efficiently. 

Note that the extracted topological structure is just an approximation, and 
hence the clearance value may not represent the distance to the nearest disk; 
there might be other disks that are nearer. So we have to check also other disks 
in order to guarantee the non-intersection. However, we found experimentally 
that it is enough to check the disks whose regions are incident to the Voronoi 
vertices that are in two or three edge distant from the vertex in consideration. 
Hence, this intersection check can still be done in constant time. 

Using the circle Voronoi diagrams, we could reduce the computation cost of 
the shake-and-shrink algorithm to a large extent. For example, for a set of 169 
wires, the computation time reduced to less than 1% of the time required by the 
naive method. 



5 Concluding Remarks 

We presented a new scheme for circumventing the most difficult step of the 
topology-based approach; the basic idea is to generate a digital-picture approx- 
imation of a geometric object and to use its topological structure to guarantee 
the consistency. The space-filling nature of the digital-picture structure makes 
it easy for us to attain consistency. Therefore, we can automatically extract the 
topological structure, and thus can avoid the hard step of finding topologically 
invariant properties of individual problems and algorithms. In this way, we can 
change the topology-based approach from a middle-level technique to a begin- 
ners’ level technique. 
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Adjacency of Optimal Regions 
for Huffman Trees 
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Abstract. The Huffman tree is a binary tree storing each leaf with a key 
and its weight with the smallest weighted search time (i.e., the weighted 
sum of external path lengths). 

The combinatorial structure of the Huffman tree depends on the weights 
of keys, and thus the space of weights is tessellated into regions called op- 
timal regions associated with the combinatorial structures. In this paper 
we investigate properties of this tessellation. 

We show that each optimal region is convex and non-empty. Moreover, we 
give a combinatorial necessary and sufficient condition for the adjacency 
of optimal regions. We also give analogous results for alphabetic trees. 



1 Introduction 



Huffman tree is a fundamental and popular data structure, and has many ap- 
plications such as efficient coding of alphabets [2, pp. 402-406]. 

We consider a binary tree T with n leaves storing keys labeled by 1,2, ..,n. 
Each key i has a weight Wi > 0. We call w = {w\,W 2 , ■ ■ ■ , Wn} the weight vector 
or the weight of T. The level vector of T is defined as I = (?i, • • • , In), where 
li is the level (i.e. depth) of the leaf labeled by i. The cost of T is defined by 
The tree with the minimum cost is called the Huffman tree for the 
set of keys. The tree with the minimum cost under the condition that key i is 
stored in the t-th leaf from the left for each i = l,2,...,nis called the alphabetic 
tree. Huffman trees and alphabetic trees are extended binary trees (i.e. trees in 
which each internal node has two children). 

Given a vector I = {l\, - ■ ■ ,ln) of natural numbers, there exists a binary 
tree T{1), corresponding to this vector if 2“^* < 1, and the equality holds 
if the tree is an extended binary tree. The vector I is invariant under some 
permutations of the leaves, and we identify the trees with the same level vector 
in this paper. 

The set £„ of all n-dimensional level vectors for extended binary trees is 
defined as 



•Cn — \ I G li 



++ 



E2-'^ = i 
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where Z"_,_ := {(a;i, . . . , Xn) \xi £ Z,Xi > 0}. For a given weight ii; > 0, a level 
vector I is optimal if I ■ w < V ■ w for any other V £ Cn- Naturally, I is optimal 
if and only if its corresponding tree is a Huffman tree. 

For each I optimal region R{1) is defined as: 

R{1) = {w\l-w<V ■w,\a' £Cr,}. 

Let Wn '■= {w = {wi, . . . , Wn) \ wi £R,Wi > 0} be the set of all weight vectors, 
and it called weight space. The set of optimal regions for level vectors in £„ gives 
a subdivision of Wn (see Figure l(left)). 




(right) 



We investigate properties of the optimal regions. Two level vectors I and I' 
are said to be similar if there exists a weight w such that I ■ w = V ■ w < 
V ■ w, \/l" £ Cn- Two level vectors I and I' are said to be adjacent to each other 
if there exists a weight w such that I ■ w = I' ■ w < I” ■ w , VZ" £ Cn- 

Clearly, adjacency implies similarity, although its converse is not always true. 
Geometrically, similarity means that the corresponding optimal regions share a 
point on their boundary, whereas adjacency means that they share a facet. 

Regarding each level vector as a point in K”, we can reduce the computation 
of Huffman tree for a weight w to the following optimization problem: 

min w ■ 1. 

l&Cn 

The weight vector gives a descent direction of the level vector. In Figure l(right) 
when the direction is w' , the level vector (2, 1, 2) is optimal. Thus, in the space of 
level vectors, the adjacency of optimal regions can be interpreted as follows: two 
level vectors I and V are adjacent to each other if and only if they are connected 
by an edge on the convex hull of In Figure 1, optimal regions i?((2, 1,2)) 
and R{{2, 2, 1)) share a facet (left), while (2, 1, 2) and (2, 2, 1) are connected by 
an edge of convex hull of £3 (right) . 
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Consider a subset An of defined by 

The labels of leaves T(l) appear in ascending order 
from left to right if we traverse the tree in inorder. 

An is a set of all alphabetic trees with n leaves and the correspondence between 

1 and T{1) is one-to-one. For example, level vector (1, 2, 2) is in ^3, while (2, 1, 2) 
is not (see Figure l(left)). 

The rest of paper is organized as follows: in Section 2, we describe well-known 
result for Huffman tree. In Section 3, we give our main results: convexity and 
non-emptiness of an optimal region, and combinatorial conditions for adjacency. 

2 Preliminaries 

In [4, p.25] the following observation is given. 

Lemma A A weighted extended binary tree on a sequence of weights 
wi,W 2 , ■ • ■ , Wn is optimal only if the following conditions hold at any 
level 1: 

1) for any three nodes a, b, c in level I w{c) < w{a) + w{b); 

2) the weight of any node at level I is greater than or equal to the weight 
of any node at level I -I- 1. 

That is, we can check the optimality by computing maximum and two min- 
imum weights at every level. Further, these conditions are necessary condition. 
Hence, any weight satisfying these conditions is contained in the optimal region 
of the tree. 

3 Properties of Optimal Regions 

In this section we describe properties of optimal regions. We show that optimal 
region of Huffman or alphabetic tree is convex and non-empty. We also give a 
combinatorial necessary and sufficient condition for the adjacency. 



An — \l & £, 



3.1 Convexity and Non-emptiness 

First, we show that any optimal region is non-empty. For any I € Cn, let w(l) 
be the weight vector such that its z-th element is Wi = 2“*b We show that T{1) 
becomes the optimal tree for w{l). 

Consider the binary tree T{1). It is shown that the T{1) with w{l) satisfies 
the condition of optimality in Lemma A. In the lowest L-th level there are only 
leaves and their weights are equal to 2~^. In the next {L — l)-th level, internal 
nodes exist and their weights are the sum of the weights of their child nodes, i.e., 
2“^ -I- 2“^ = Leaves in the {L — l)-th level also have weight 

Thus, every node in the (L— l)-th level has weight Similarly, it is shown 

that every node has weight 2“* in the Fth level. Consequently, it is evident that 
the tree with the weight satisfies the condition 1) and 2) in Lemma A. 
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An is a proper subset of If a given vector I is contained in An and optimal 
for w{l) among then the tree is also optimal in An- 

Consider a weight w{l) +e where e := (si,-'’ An) and \si\ <C It is 
similarly shown that the level vector I is optimal for w{l) +e. Thus the optimal 
region has full dimension. 

Moreover, we state that the optimal region is convex. Since the optimal region 
is defined by intersection of halfspaces, the region is a convex cone. 

Theorem 1. The optimal region of any level vector is convex and has full di- 
mension. 



3.2 Similarity 

In this section we give a necessary and sufficient condition for the similarity. 

First, we show that there exists a weight such that both of two level vectors 
i, V are optimal for it if I — I € {—1, 0, 1}”. 

Lemma 1. Let I, V he two n-dimensional level vectors with I — V € {—1, 0, 1}”. 
Consider a weight ^w{l) + ^w{V). Then, the weight of any node in the l-th level 
ofT{l) andT{V) is in the range [§2“*, §2”^] • 

Proof. We use induction on the level of T(Z). Consider the lowest L-th level in the 
tree. At this level only leaves exist. The difference of leaf level between T{1) and 
T(Z') is at most one, the pair of levels is one of the following three types: (L, L— 1), 
(L,L), (L,L-|-1), therefore, their weights are either (2“^-|-2“i^+^))/2 = 

{2~^ + 2~^)/2 = 2~^ or {2~^ -k 2~^^-'^^)/2 = | • 2~^. Thus, all weights of 
\w{l) -\- \w{V) are included in [j2~^ , §2“^]. 

Assume that all weights are included in [|2“^ §2”*] at the Lth level. We 
show that weight of any node in the {I — l)-th level is in [| • | • . 

From the assumption, for any node a in Z-th level its weight w{a) satisfies the 
following inequality: | • 2“* < w(a) < | • 2~‘. Since weight of an internal node is 
given by the sum of the weights of the two children a and b. The weight of the 
internal node in the (Z — l)-th level is bounded by 

-•2-('-i) <rc(a)+u;(6) 

4 2 

Moreover, it is similarly shown that the weight of leaves in the (Z — l)-th level 
is also bounded. Consequently, all nodes in the (Z — l)-th level are included in 
[3 .2-d-i), |.2-d-i)]. □ 

From this lemma, we can check the conditions in Lemma A. Since the 
minimum weight in the Z-th level is j-2~^ and the maximum weight in the (Z-b 1)- 
th level is | • condition 2) is satisfied for any level. From this lemma, 

it is also shown that the condition 1) is satisfied. It is enough to show only the 
most extreme case. Since every weight at the Z-th level is included in [| • 2 ~^ , 
I • 2“*] , the extreme case is one that the smallest weight w{a) = w{b) = | • 2“* 
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and the largest weight w{c) = | • 2~K The inequality w{c) < w{a) + w{b) holds 
in the case. 

In the case of an alphabetic tree, the set An is a subset of £„ and the level 
vectors are optimal for \w{l) + ^w{V) in Then I and V are optimal for 
^w{l) + \w{V) in An- 

From the discussion above, we show the following lemma. 

Lemma 2. Let I, V be level vectors such that l — V G {—1, 0, 1}”. Then, extended 
binary trees T{1) and T{V) are optimal for the weight \w{l) + ^w{l'). 

Similarly, we can show that there exists a weight vector such that two or 
more level vectors are optimal. 

Lemma 3. Consider m level vectors , . . . ,1^) {k = 1, . . . ,m) such that 

pi _ p '2 g { — 1, 0, 1}” for any ji and j 2 - Let w{C , . . . , I™) be a weight whose i-th 
component is defined by 

_ ( 2“^* {when l\ = qt for any j) 

“ 1 i (2-«^ + 2-(9^+i)) {when t = Qi and If = ft + 1) ’ 

where qi is a positive integer. Any is optimal for the weight w{l^ , . . . , Z™) 

Proof. For each pair iL and , the difference of leaf level is at most one. The 
values of l\s can be one of two cases: 1) all have the same value; 2) the values 
of fs are divided into two integers qi and + 1. It is shown that T{P)s with 
w{P , . . . ,1™) are optimal by similar discussion as above. □ 

Next, we show the converse of lemma 2. 

Lemma 4. If level vectors I and V are similar, then the following relation holds: 

l-T G {-1,0,1}". 



Proof. Fix a level vector I and its Huffman tree T{1), whose optimal region is 
denoted by R{1). 

Assume that R{1) is expressed only by inequalities (Z — l')w < 0 where Z' 
satisfies Z — Z' G (—1, 0, 1}”. Then, for any weight w on the boundary of R{1), 
the following relations hold: (Z — V)w = 0, Z — Z' G (—1, 0, 1}”. 

Since Z and V are similar, there exists a weight w such that w is on the 
boundary of R{1) and of i?(Z'), and T{1) and T(Z') are optimal for the weight 
w. Because ii; is a point on the boundary of the optimal regions, Z' satisfies 
Z-Z' G {-1,0,1}”. 

In the rest of this proof, we show that for any Z R{1) is expressed only by 
inequalities (Z — V)w < 0 (Z — Z' G {—1,0, 1}”). 

Consider the conditions in Lemma A, which are necessary conditions for a 
Huffman tree: 

1. for any three nodes a, b, c in the Z-th level, w{c) < w{a) + w{b); 

2. for any node x in the Z-th level and y in the (Z -I- l)-th level, w{x) > w{y). 




18 



Kensuke Onishi 



We show that from each condition above a level vector I' ^ I satisfying I — V & 
{ — 1,0, 1}” is computed. 

Let L{a) be the set of leaves of the subtree with root node a. The weight of 
the internal node a is the sum of the weights of leaves in L(a). 

Since L{a), L{b) and L(c) are disjoint, the first condition becomes 

Wi< ^ Wi + ^ Wj. (1) 

i£L{c) i£L{a) i^L{b) 

The inequality (1) is expressed as tu • d < 0, where each element of the vector 
d is 0 or ±1. Without loss of generality, we assume nodes a and b are siblings 
in T{1). Let T' be an extended binary tree which is obtained by exchanging the 
subtree with root node c and the subtree with root node which is the parent 
of a and b in T{1) (see Figure 2). This tree T' is different from T{1). Let V be 
the level vector of T'. From inequality (1), we have (Z — l')w < 0, and the level 
vector V satisfies I — V G (—1, 0, 1}”. 




Fig. 2. Huffman tree T(Z) and T' 



Fig. 3. Huffman tree T{1) and T” 



The second condition also gives an inequality: 

iGL(x) iGL{y) 

When X is the parent of y, L{x) ^ L{y) holds, otherwise L{x) n L{y) = 0. Con- 
sider the case that x is the parent of y. The inequality (2) becomes '^i(zL(x)\L{y) 
Wi > 0. As every Wi is positive, this inequality gives a trivial bound. Consider 
another case. For the inequality (2), let T" be an extended binary tree which 
is obtained by exchanging the subtree with root x and the subtree with root 
y in T{1). Let I" be the level vector of T" (see Figure 3). Therefore, inequal- 
ity (2) is also transformed to U — l")w < 0 and the level vector I" satisfies 
l-V G {-1,0,1}". 

Let R'{1) be a polytope which is determined by these inequalities (1), (2). 
Since R' {1) is defined by inequalities for i?(Z), R' {1) contains R{1). On the other 
hands, the conditions in Lemma A are necessary conditions. That is, inequal- 
ities (1), (2) are also necessary conditions for optimality of T under w in W„ 
and hence the region R'{1) is subset of R{1). Thus, R{1) is equal to R'{1) as a 
polytope. The optimal region of I is defined only by those inequalities (1), (2) 
whose level vectors satisfy Z — Z'g{— 1,0,1}". 
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Consider alphabetic trees. The conditions of optimal regions of alphabetic 
trees are similar to those of Huffman trees. The rest of the discussion is analogous. 

□ 

Remark 1. For a level vector I there may exist several T(Z)s. Conversely, when 
a Huffman tree T is given, its level vector I is uniquely determined. 

Thus, we have the following characterization for the similarity. 

Theorem 2. Two level vectors I and V are similar if and only if I — V G 

{- 1 , 0 , 1 }". 

Consider all level vectors of alphabetic tree An- In this case, there is a well- 
known transformation from an alphabetic tree to another, called a rotation (see 
[3]). We show that two alphabetic trees are similar if the trees are transformed 
to each other by a single rotation. 

Corollary 1. Let I and V be level vectors in An- If alphabetic tree T{1) is trans- 
formed to the tree T{V) by a single rotation, then I and I' are similar - 

Proof Rotation is defined for a pair consisting of a node and its parent. Suppose 
the node is a right child of the parent. By the rotation the parent and its left 
subtree move one level down, and the node and its right subtree move one level 
up. Other parts are unchanged. Since the difference of level vectors is at most 
one, therefore, I and V are similar. □ 

Remark 2- For binary search tree, similar relation was already shown in [5] . 

3.3 Adjacency 

Two level vectors are similar if and only if the difference between the levels of 
each leaves is at most one. In this section we give a necessary and sufficient 
conditions that two level vectors are adjacent. 

First, we state non-adjacency condition. 

Lemma 5. Consider level vectors l,V - If there are two level vectors m,m' such 
that I V = m m' , then I and V are not adjacent- 

Proof- Consider a weight w such that I - w = V - w. 

Compute the inner product between I -\- V and the weight w. 

[I V) - w = {m m') • w 
21 - w = m - w -\- m' ■ w 

Since every element of the level vector I and the weight w are positive, I ■ w, 
m - w and m' - w are positive. Thus the following inequalities hold: 

I - w > m - w or I - w > m' - w. 

When this inequality holds, I is not optimal for w. When this equality holds, 
I - w = m - w = ml - w. In this case, I' ■ w has the same value as I ■ w. Thus 
the value of the inner product between the four level vectors and w is the same. 
Since m is also optimal, I and V are not adjacent to each other. □ 
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Fig. 4. Similar HufTman trees 



Figure 4 shows four Huffman trees. These Huffman trees are simultaneously 
optimal for w = (1/4,3/16,3/16,3/16,3/16). In this case the pairs {I, V), (m, 
m') are non-adjacent. The other pairs are adjacent. Geometrically, there is a 
hyperplane whose normal vector is w such that the hyperplane is supporting 
hyperplane of the convex hull of £„ and contains these level vectors. 

Next, we state the following lemma. 

Lemma 6. If level vectors 1,1' are non-adjacent, then the either of the following 
conditions holds: 

1. there exists an index i such that \li — l'f\ > 2; 

2. there exist two level vectors m and m! such that I 1' = m m! ; 
where li(l'J is the i-th element ofl(l'), respectively. 

Proof. Omit. □ 

Finally, we have a combinatorial necessary and sufficient condition for the 
adjacency from Lemma 5 and Lemma 6. 

Theorem 3. Two n-dimensional level vectors I and I' are adjacent if and only 
if the both of the following conditions holds: 

1. l-T G {-1,0,1}" and 

2. for any other level vectors m and m! , I 1' m m' . 

4 Conclusion 

In this paper we deal with properties of optimal regions for Huffman trees and 
alphabetic trees. These trees are expressed by level vectors. The optimal region 
of level vector is convex, non-empty and full-dimensionality. We characterize 
similarity and adjacency between two level vectors: two level vectors are similar 
if and only if the difference of the level vectors is contained in (—1, 0, 1}". Two 
level vectors of 1,1' are adjacent if and only if the two level vectors are similar 
and for any other level vectors m, m' it holds that Z -|- F yf m -|- m' . 

We state two applications of our results: one is computation of similar ex- 
tended binary trees for a given extended binary tree, another is enumeration of 
all extended binary tree by using similarity and reverse search. 
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In the former application, when an extended binary tree I = {h,--- ,ln) 
is given, similar trees are computed. The similar trees are useful for dynamic 
Huffman tree, or coding. Consider the case of Huffman tree. In the proof of 
Lemma 4, we state that one inequality corresponds to one similar level vector. 
For example, the similar vector I' = Jn) inequality (1) becomes as 

follows: 

n* + 1 (if f G L{c)) 

K = I h — I {ii i G L{a) U L{b)) . 

[ li (otherwise) 

Consider the case of alphabetic tree. Ramanan showed optimality condition for 
alphabetic tree in [6] . The condition for alphabetic tree is similar to that for Huff- 
man tree. Moreover, we state that one inequality of the condition corresponds 
one similar alphabetic tree. Hence, for given alphabetic tree, every similar alpha- 
betic trees can be computed from the inequality. For example, consider a level 
vector I = (1,3, 3, 2). In the tree T{1), node 2 and node 3 are the descendant of 
the sibling node of node 1, therefore, wi > W 2 + holds. In this case, node 1 
is down by one level, and node 2 and node 3 are up by one, then a similar level 
vector (2, 2, 2, 2) is computed. 

In the latter application, we consider enumeration of all extended binary trees 
by using reverse search [1]. Reverse search is a method of enumeration of nodes 
of a given graph by using parent relation. We only state how to use reverse search 
in this case. The node set of a graph is £„(or An). Two nodes are connected 
by an edge if the corresponding level vectors are similar. Lexicographic order is 
induced in £„. The parent relation is defined as follows: the parent of a given 
vector I is the smallest level vector among all similar level vector of I by the 
order. The enumeration starts from a node (1, 2, • • • , n — 1, n — 1) which is the 
smallest node in Some procedures are designed for using reverse search, for 
instance, computation of the parent and calculation of the next child node. These 
procedures can be programmed easily in this case. 

Moreover the case of alphabetic tree is equivalent to enumerating objects 
whose number is equal to Catalan number. Such objects have many different 
combinatorial interpretations. When considering a transformation from the level 
vector to other interpretations, the objects are also enumerated. One of the 
interesting interpretation is the triangulations of a convex polygon. In [7], it is 
shown that a rotation among binary trees is equal to a diagonal flip among the 
triangulations of convex polygon. The similarity of extended binary trees is a 
more useful relation than the rotation. Therefore, this relation would be used for 
optimization on extended binary trees, or on triangulations of a convex polygon. 

Finally, we describe the relation between the optimal region of Huffman tree 
and that of alphabetic tree for one level vector. Consider a level vector I in An. 
This vector is also included in £„. Let R{1) and S{1) be optimal regions in £„ 
and in An, respectively. Any point in £„ is optimal in An, but the converse is 
not true. Then the following relation holds: 



R{1) C S{1). 
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In information theory it is shown that the average length of a code using a 
Huffman tree becomes optimal and that using an alphabetic tree does not attain 
to optimal. This fact is consistent with above the relation above. 
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Abstract. We introduce a method for constructing optimally univer- 
sal hash families and equivalently RBIBDs. As a consequence of our 
construction we obtain minimal optimally universal hash families, if 
the cardinalities of the universe and the range are powers of the same 
prime. A corollary of this result is that the necessary condition for the 
existence of an RBIBD with parameters (v,k,X), namely u mod fc = 
X{v — 1) mod {k — 1) — 0, is sufficient, if v and k are powers of the 
same prime. As an application of our construction, we show that the 
fc-MAXCUT algorithm of Hofmeister and Lefmann [9] can be imple- 
mented such that it has a polynomial running time, in the case that the 
number of vertices and k are powers of the same prime. 



1 Introduction and Results 

The concept of universal hashing as introduced by Carter and Wegman [4] in 
1979 has found a wide variety of applications. Besides of being an important 
tool for hashing schemes (see e.g. [4,8,7]), universal hashing has been used in 
many other areas of computer science such as complexity theory, cryptography 
or algorithmics. The importance of this concept has led to a search for practical 
hash families with good properties and to the investigation of the combinato- 
rial structure of such hash families. In 1994, Stinson [15] has drawn the first 
connections between universal hash families and combinatorial designs such as 
resolvable balanced incomplete block designs or orthogonal arrays. Later on, 
more connections to combinatorial designs and other structures as, e.g., authen- 
tication codes were discovered. While on one hand, such connections have led to 
new applications of universal hashing in cryptography or the theory of combi- 
natorial designs, they also allowed new constructions of universal hash families 
and new ways of analyzing them (for some references see e.g. [16, 17, 1,3]). 

Definition 1. An (V;u,r)-hash family H is a family of N functions U ^ R, 
where U and R are finite sets of cardinality u > 1 and r > 1, resp. 

If is a hash family of functions U ^ R, then U is called universe and R is 
called range of Id. We call the elements from the universe keys^ and say that 
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two keys x\,X 2 G U collide under a function h € H, if h{x\) = h{x 2 .)- The 
collision probability of xi and X 2 is the probability that xi and X 2 collide under 
a randomly chosen function in H, i.e., Prob/ig-H (/i(xi) = h{x 2 ))- 

It was the original intention behind using hash families in hashing schemes 
to guarantee that under a randomly chosen hash function the expected number 
of collisions among a set of keys is low. This motivates the following common 
definition. 

Definition 2 ([4,15]). A hash family with universe U is e-universal, if any 
two distinct keys xi,X 2 G U have a collision probability of at most e. A (1/r)- 
universal (N;u,r) -hash family is simply called universal. 

Many constructions of e-universal hash families have been proposed, and some 
of them can be implemented by means of simple arithmetic operations such as 
multiplication [6] or convolution [11]. But besides the search for efficient hash 
families, there has also been some interest in investigating the properties of hash 
families with extremely small cardinalities or collision probabilities. 

We denote by gcd and Icm the greatest common divisor and the least com- 
mon multiple, resp. It can be shown [13] that no e-universal (TV; m, r)-hash fam- 
ily exists, if e < (u — r)/(r{u — 1)), and that for any e-universal (fV; m, r)-hash 
family with e = (u — r)/{r{u — 1)), it is iV > (u — l)/gcd (u — 1, r — 1) . These 
properties justify the following definition. 

Definition 3 ([13]). An e-universal {N; u, r)-hash family H is called optimally 
universal or short OU, if e = eopt{u,r) := {u — r)/(r{u— 1)). If in addition 
N = {u — l)/gcd(u — 1, r — 1), then H is called minimal optimally universal. 

Sarwate [13] has presented constructions of minimal optimally universal 
(TV; g", g*”)-hash families for any prime power q in the cases m = 1 and m = n— 1 
(n > 1). He states though, that while several ad hoc constructions of small OU 
universal {N; g”, g'")-hash families can be obtained for many values of n and m, 
there is no general construction method known that produces minimal OU hash 
families for all n and m. The main purpose of this paper is to present such a 
general construction method, as summarized by the following theorem. 

Theorem 1. Let q be an arbitrary prime power. For any 1 < m < n there exists 
a minimal optimally universal {N;q'^,q'^) -hash family. 

The construction is based on the definition of a new type of hash families, called 
partitioned universal. Our proof does not only show the existence of the op- 
timally universal hash families, but describes in fact an efficient algorithm for 
constructing them. Moreover, the resulting hash functions can be evaluated fairly 
simple by means of finite field arithmetic. We show below, that our construction 
has algorithmic consequences as well as consequences for the existence of certain 
important combinatorial objects. 
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Definition 4. A balanced incomplete block design (short: BIBDJ is a pair 
(V,B), where V is a set of points and B is a set of k -subsets ofV called blocks, 
such that for any two distinct points p,p' G V there are exactly A blocks b G B 
where {p,p'} C b. The parameters k and A are called block size and intersection 
number, resp. If some blocks form a partition of V, then the set of these blocks 
is called parallel class. A BIBD is resolvable, if its blocks can be partitioned into 
parallel classes. A resolvable BIBD with v points, block-size k and intersection 
number A is denoted by RBIBDA[fc;f]. 

RBIBDs are well investigated combinatorial structures and a lot of research 
has been spent on finding RBIBDs with certain parameters or on disproving 
their existence (a broad overview on results can be found in the monographs 
[2,5]). BIBDs and RBIBDs have also some algorithmic applications. E.g., the 
construction of layouts for redundant disk arrays in [10, 14] is based on BIBDs 
and Hofmeister and Lefmann [9] show how to find a large fc-cut in a graph with 
n vertices using an RBIBDs [A:; n]. The authors complain, though, that although 
various algebraic construction methods for BIBDs are known in the literature, 
not a lot of attention has been paid to their exact running times. 

Besides of the algorithmic problem of constructing RBIBDs, it is an impor- 
tant open question of design theory, for which parameters RBIBDs do exist. It 
is well-known that the following two conditions are necessary for the existence 
of an RBIBDa[A:; u]: 

(Cl) u = 0 (mod A:) and (C2) A(u — 1) = 0 (modA: — 1). 

It is easy to see that the second condition is fulfilled if and only if A is a mul- 
tiple of Xjnin{k,v) := {k — l)/gcd(A: — 1, u — 1). Thus, Ai„in(A;,u) is the minimal 
intersection number an RBIBD over v points and with block size k can have. 

As Stinson [15] has shown, an OU hash family Tt of functions U ^ R de- 
scribes an RBIBD, by taking U as point set and each set h~^{y) with y G R and 
h G Ti diS & block. Clearly, for a fixed h, the blocks h~^{y) with y G R form a 
parallel class. Taking into account that in an OU hash family any pair of keys 
has the same collision probability [13], it is easy to see that this block design is 
in fact an RBIBD. This construction can also be reversed such that one obtains 
from any RBIBD an OU hash family. 

Theorem 2 ([15]). An optimally universal {N;u,r)-hash family exists if 
and only if there is an RBIBDa[A:;u] with v = u, k = u/r, and A = 
N{u — r)/{r{u — 1)). 

Plugging the minimal intersection number Amin into this theorem, it is easy to 
see that a minimal optimally universal (AV; u, r)-hash family exists if and only if 
there exists an RBIBDv[M/r; m] with A = Amin(u/uw). Using this equivalence, 
our construction method of minimal OU hash families from Theorem 1 implies 
the existence of an RBIBDv,„j„(fc,n) [A:; u] for any k = and v = where q 

is a prime power (and m < n). Note that if u yf 0, then (Cl) obviously implies 
k < V. Since in addition any A satisfying (C2) is a multiple of Amin(A;,u), for 
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any such A an RBIBDA[A:;f] can be obtained by taking multiple copies of an 
R'BIBDA^,„(fe,„)[fc; u]. 

Corollary 1. The necessary conditions (Cl) and (C2) for the existence of an 
RBIBDa[A:;w] are sufficient, if k and v are powers of the same prime. 

Due to the fact that the hash family from Theorem 1 can be constructed ef- 
ficiently, we obtain also an efficient algorithm for constructing these RBIBDs. 
Therefore, they may be useful for algorithmic problems. 



2 Application to fc-MAXCUT 

In [9], Hofmeister and Lefmann have presented a deterministic algorithm for 
computing a large fc-cut in a graph, using RBIBDs. Let G = (V, E) be an 
undirected graph whose edges are weighted by a function w : E ^ Nq. By w{G) 
we denote the sum of edge weights, i.e., w{G) = ^ balanced k-cut 

of G is a partition of the vertices V into k sets V\, . . . ,Vk of equal size. The cut 
size of a /c-cut {V\,. . . , Vk) is the sum of weights of all crossing edges, that is 
edges e = {u, w} with (v, w) G Vi x Vj with i ^ j. 

Assume that an RBIBDA[n;fc] is given by a description of the blocks, and 
that the list of these blocks is already arranged in such a way, that all blocks 
of each parallel class appear one after the other. Hofmeister and Lefmann have 
shown that in this case a balanced /c-cut of size at least w{G) ■ • (1 -I- ;^) 

can be computed for any graph G with n vertices in time 0{k{n + m) + An^). 
Note that for any constant 0 < e < l/(fc — 1), fc > 2, it is NP-complete to decide 
whether a graph with m edges admits a /c-cut of size at least m- • (1 -l-e) [12]. 

The algorithm of Hofmeister and Lefmann relies though, on having the ex- 
plicit description of an RBIBD with appropriate parameters at hand. Since most 
known RBIBDs are ad-hoc constructions and algorithms for the construction of 
RBIBDs for a wide variety of parameters are rarely described in literature, it is 
not clear for which parameters the algorithm can be implemented. In fact, the 
only statement Hofmeister and Lefmann can make is that for n being a multiple 
of 3 and n,n— lorn — 2 being a prime power, a polynomial time algorithm 
exists for the construction of the desired RBIBDs. 

Using our construction of optimally universal hash families, we obtain now 
explicit implementations of their algorithm for all n and k being powers of the 
same prime. We assume a RAM model allowing arithmetics over natural numbers 
and finite fields of order at most n in constant time. 

Corollary 2. Ifn and k are powers of the same prime, then for any graph with 
n vertices and m edges a balanced k-cut of weight at least w{G) • • (1 -I- 

can be computed in time 0{k{n m) k ■ n^/gcd (n — 1, /c — 1)) . 

Note that this result is obtained by simply plugging our RBIBD construction 
into the algorithm of Hofmeister and Lefmann. We can even do better if we want 
to find /c-cuts with k > ^/n by a very similar and simple algorithm using OU 
hash families. 




A Construction Method for Optimally Universal Hash Families 



27 



Let G = (V,E) be a graph with n vertices and let H be an e-universal 
(-/V; n, fc)-hash family with universe V and range {1, . . . , k}. Any hash function 
h gH defines a cut (Vi, . . . , Vk) on the graph, by letting Vi = {v & V \ h{v) = i}. 
Moreover, if Ti is optimally universal, then the cut is balanced, because it is well- 
known that in this case = n/k (see e.g. [13]). Now choose a random hash 

function h from Ti and consider the cut (Vi, . . . , 14) defined by h. Since Ti is 
e-universal, each edge is a crossing edge with a probability of at least 1 — e. 
Hence, by the linearity of expectation, the expected cut size is bounded below 
by (1 — e)w(G). This means that there exists a hash function h G Ti which defines 
a cut having at least that size. Using e = eopt(n, k) = {n — k)/{k{n — 1)) if Ti 
is optimally universal, yields an edge weight of at least w{G) ■ (l — ) = 

rc(G).^.(l + ;^). 

As we will show in the remainder of this paper, the hash functions from the 
minimal OU hash family Ti claimed to exist in Theorem 1 can be enumerated in 
time 0{N), where N = {n— l)/gcd(n — 1, k — 1). Moreover, under the assump- 
tion of a RAM where (finite field) arithmetics can be done in constant time, 
the hash functions can be evaluated in constant time. Once we have picked a 
hash function h G Ti, we can evaluate the hash function values of all n vertices 
in time 0(n) in order to compute the corresponding cut (Vi, . . . , 14). Now, for 
each set U, the sum of weights of edges within Vi (that is edges e = {u, w} with 
v,w G Vi), can be computed in time 0{ri^ /k^^ (assuming that the graph is given 
by an adjacency matrix). This way, we can compute the sum s(Vi, . . . , 14) of 
weights of all non-crossing edges in time 0{v? jk) and the fc-cut with the mini- 
mal value s( 14 , . . . , 14) has the maximal weight w{G) — s(14 , . . . , 14) . Hence, the 
total running time is bounded by 0{N ■ /k) = 0{n^ /{k ■ gcd (n — 1, k-l))). 

For k > ^/n this is better than the result of Corollary 2. 

Theorem 3. If n and k are powers of the same prime, then for any graph 
with n vertices given by an adjacency matrix, a balanced k-cut of weight at least 
w{G) ■ • (1 -I- can be computed in time o{n^ / {k ■ gcd (n — 1, k — 1))^ . 

3 Partitioned Universal Hash Families 

In the following, we define a new type of hash families, called partitioned uni- 
versal. They are the main ingredients for our construction of optimally universal 
hash families. 

Definition 5. An {N;u,r)-hash family is partitioned e-universal if it is e- 
universal and if there exists an equivalence relation partitioning the universe 
in equivalence classes of size r such that any two distinct keys from the same 
equivalence class have a collision probability of 0. In the case e = 1/r the hash 
family is simply called partitioned universal. 

In the remainder of the text we use the convention that if the universe of a a 
partitioned universal hash family with range R is written as W x R, then the 
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equivalence classes to which the above definition refers to, are the classes Ux, 
X €W, where Ux = {(a;, a;') | x' G R}. 

While we have defined partitioned e-universal hash families mainly in order 
to prove Theorem 1, there are several examples of well-known e-universal hash 
families, which in fact turn out to be e-partitioned universal. One example where 
this is the case is the multiplicative (2/r)-universal hash family defined in [6]. As 
we will see in the following, partitioned universal hash families are quite easy to 
construct by means of another type of hash families, defined by Stinson in 1996. 

Definition 6 ([17]). Let H : U ^ R be an {N;u,r)-hash family, where R is 
an additive abelian group. H is ealled e-Z\-universal if for any two distinct keys 
xi,X 2 G U and any d € R, Prob^g-H {h{xi) — h{x 2 ) = d) < e. In the case 
e = Ijr, Ti, is simply called A-universal. 

Here is a well-known construction (see e.g. [17]): Denote by a finite field 
of order q and let U = (F^)" and R = (Fg)"* be extension fields of Fg. If 
(j) : U ^ R is & surjective homomorphism then it is easy to see that the family 
{fa I a G Fgn}, where fa ■ U ^ R, x (j){ax) is Z\-universal. If n < m, then 
any Z\-universal (q™; g™, g’”)-hash family is also Z\-universal for an arbitrary 
^"-element subset of the universe. Thus, one gets the following result. 

Lemma 1 ([17]). Let q be a prime power. For any positive integers n and m, 
there exists a A-universal {N; g”, q™)-hash family, where N = max{g”, g*”}. 

Using e-Z\-universal hash families, it is very easy to construct partitioned 
e-universal ones as the following lemma shows. 

Lemma 2. If there is an e-A-universal (N;u,r) -hash family, then there is also 
a partitioned e-universal {N; ur, r)-hash family. 

Proof. Let Ti. : U ^ Rhe a,n e-Z\-universal hash family where \U\ = u and |i?| = 
r. For each h G Hwe define the mapping fh'.UxR^R, (xi,X 2 ) h{xi) -\- X 2 . 
Then T = {fh \ h G H} is the desired partitioned e-universal {N]ur,r)-hash 
family. To see this, define the equivalence relation ~ on {7 x i? as (xi,X 2 ) ~ 
{x{,x' 2 ) x\ = x{. Clearly, two different keys {x\,X 2 ) and (xj, collide if and 
only if h{x\) — h{x{) = x'^ — X 2 . Because 77 is e-Z\-universal, this is for yf x\ 
the case with a probability of at most e. If a;i = then both keys belong to 
the same equivalence class and they differ only in the second component. Thus, 
they do not collide under any function in T . Since the size of each equivalence 
class is r, T is partitioned e-universal. □ 

It is well-known that ujr is a lower bound on the size TV of a (l/r)-universal 
(TV; M, r)-hash family (see e.g. [16]). This is clearly not a tight lower bound for 
the case r < u < r^, since for r < u, there is at least one key pair whose collision 
probability is not 0, and thus at least r hash functions are required to obtain a 
collision probability of at most 1/r. 

Definition 7. A (partitioned) universal (N;u,r) -hash family is called minimal 
if N = 1 in the case u < r and TV = max{u/r, r} in the case u > r. 
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Note that for u < r, a, minimal partitioned universal {N; u, r)-hash family is the 
set consisting of the identity, only. Using Lemma 1 and Lemma 2 for the case 
u > r, we obtain minimal partitioned universal hash families for all u and r 
which are powers of the same prime. 

Corollary 3. Let q he a prime power. For any positive integers n and m, there 
there exists a minimal partitioned universal {N;q'^,q'^)-hash family . 

We present now two construction methods, which can later be combined 
to recursively construct the minimal OU hash families from Theorem 1. More 
precisely, in the following two lemmas we show that we can combine a partitioned 
universal (W; u, r)-hash family either with an OU {N 2 ]r,r‘^ /u)-h.ash. family (in 
the case r < u < r^) or with an OU r)-hash family (in the case u > r^) 

in order to obtain an OU {N; u, r)-hash family. 

Lemma 3. Let u = k'^ ■ £ and r = k •£. Lf Q is a partitioned universal 
{Ni;k^£,k£) -hash family and T is an optimally universal {N 2 ] k£,£) -hash fam- 
ily, then there is an optimally universal {N;u,r)-hash family H, where N = 
NiN 2 {u - l)/gcd(fVi(r - 1), N 2 (u — r)). Moreover, if Q and T are both mini- 
mal, then so is hi. 

Proof. Let K and L be sets of cardinality k and £, resp. We may assume w.l.o.g. 
that Q consists of mappings KxR ^ R, where R = KxL, and T consists of map- 
pings K X L ^ L. Let z = lcm(Ni(r — 1), N 2 {u — r)) and zi = zj {fN\(r — 1)) 
as well as Z 2 = zj [N 2 {u — r)). Let the family Q' consist of zi copies of each hash 
function in Q and the family T' consist of Z 2 copies of the hash functions in T . 
For each function / G U IF' we define the mapping 



hf-.KxR^KxL, (xi,X2) 



f{xi,X2) iif&G', 
{xi,f{x2)) iifeR'. 



Finally, the hash family H consists of the functions hf with f G Q' U !F' . 

A somewhat technical computation (which we omit here due to space re- 
strictions) shows that the cardinality of H is in fact N and that Q and T being 
minimal implies that N = {u — l)/gcd(u — 1, r — 1) (i.e., H has the size of a 
minimal OU hash class). Hence, it remains to show that H is optimally uni- 
versal. Let c = U G'\, which is the probability, that a randomly chosen 

/ G tj' U IF' is an element of Q' . Hence, 



Z\Ni l/(r— 1) u — r u — r , , 

Z 2 N 2 -\-ziNi 1/(m — r) -I- l/(r — 1) r — 1 -|- u — r u—1 



Thus, by definition of optimally universality it suffices to show that any two 
distinct keys x = (xi, X 2 ) and x' = {x'i,x' 2 ) in Lf x i? have a collision probability 
of at most c/r. 

Assume first x\ = x\, thus X 2 x' 2 . Since in this case x and x' are elements 
of the same equivalence class with respect to the partitioned universality of Q' , 
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they do not collide under any function hj with f € ■ Under the condition that 

/ G IF', it follows from IF' being OU that hf{x) equals hf{x') with a probability 
of at most (r — €)j (£(r — 1)) . Therefore, we have for randomly chosen f ^G'\JT' 
a total collision probability of at most 

T — £ r— 1 r — £ r — £ rk — £k u — r c 

£{r — 1) u — 1 £{r — 1) £{u — 1) k£{u — 1) r{u — 1) r 

Let now xi yf x[. Under the condition that / was chosen from G' , the collision 
probability of x and x' is at most 1 /r. If on the other hand / was chosen from IF', 
then the keys do not collide at all, which follows straight from the definition of 
hf. Therefore, we again have a total collision probability (for / chosen randomly 
from G' U IF') of at most c/r. □ 

Lemma 4. Let u = kr and k > r. If G is a partitioned universal {Ni;u,r)~ 
hash family and T is an optimally universal {N 2 ]k,r)-hash family, then 
there exists an optimally universal {N;u,r)-hash family H, where N = 
NiN 2 {ur — r) /gcd(^Ni{u — r), N 2 {ur — u)). Moreover, ifG and T are both min- 
imal, then so is hi. 



Proof. Let K he a, set of size k and assume w.l.o.g. that G and T consist of 
mappings K x R ^ R and K ^ R, resp. Let 2 = lcm(A^i(M — r), N2{ur — u)) 
and zi = zj [Ni{u — r)) as well as Z 2 = zj [N2{ur — u)). Further, let the family 
G' consist of zi copies of each function in G and the family T' consist of Z 2 copies 
of each function in T . For each / G fj' U IF' we define the hash function 



hf-.KxR^R, (xi,X2) 



f{xi,X2) iif&G', 
f{xi) iffer. 



Finally, let H be the family of functions hf with f G G' hi tF' . 

A straightforward computation (which we omit here due to space restrictions) 
shows that \H\ = N and that N = {u — l)/gcd(rt — 1, r — 1) if ^ and T are 
minimal. It remains to show that hi is optimally universal. Let e = \T'\/\T' h G'\- 
Then 

Z2N2 l/{ur — u) u — r u — r 

ziNi~\-Z2N2 1 / {u — r) 1 / {ur — u) ur-u-\-u — r ur — r 



Thus, it suffices to show that any two distinct keys x = {x\,X2) and x' = (x(, X 2 ) 
va K X R collide under a randomly chosen function in hi with a probability of 
at most e. Assume first that x\ = x\ and thus X2 yf a:^. Then, by partitioned 
universality of G, x and x' do not collide under any function in G' , but under 
all functions in T' . Therefore, the collision probability is exactly e. Let now 
xi yf x[. Under the condition f £ G' the keys collide with a probability of at 
most 1/r. If on the other hand / G IF', then the collision probability is at most 
(fc — r)/(r(fc — 1)) since IF' is optimally universal. Therefore, the probability 
that hf{x) equals hf{x') for a randomly chosen f £ hi is bounded above by 



(1-c) 



1 

- + e- 
r 



k — r 
kr — r 



1 - 



1 u — r ulr — r 




u — r 



= e. 



□ 



ur — r 



ur — r 
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We can now combine the two construction methods in order to prove our 
main result. 

Proof of Theorem 1: Let q be an arbitrary prime power. We show by induc- 
tion on n that for all 1 < m < n there exists a minimal optimally universal hash 
family U ^ R, where U and R are sets of cardinalities u = and r = q"^, resp. 
In order to make clear, that this is not merely a proof of the existence of such 
a hash family, but rather an explicit construction, we describe such a minimal 
optimally universal {N; u, r)-hash family 

We use as the universe and range the extension fields U = (Fg)” and 
R = (Fg)™, resp. Let for all m < n the hash family Gnm consist of the 
functions fa '■ (Fg)””™ x (Fg)™ — > (Fg)"*, (xi,X 2 ) 4>{a ■ x\) + X 2 , where 

(f) : (Fg)"“™ ^ (Fg)'" is a projection from (Fg)"“™ to m arbitrary coordinates 
of the extension field. Note that for the computation of fa the multiplication is 
in the field (Fg)””*” while the addition is in (Fg)*”. According to the discussion 
at the beginning of this section Gf „ is minimal partitioned universal. 

If m = n, then contains only the identity id : (Fg)" ^ (Fg)", which 

is obviously minimal optimally universal. Hence, for n = 1 we obtain a trivial 
hash family Hl i- Let now 1 < m < n and assume that the hash families „/ 
have been constructed for all 1 < < n. In the case m < n < 2m, 

we choose Tif ^ as the union of Zi copies of Gn m Z 2 copies of all map- 
pings (Fg)”""* X (Fg)*” ^ (Fg)”"™ X (Fg)^™"”, {xi,X2) ^ {xi,f{x2)), with 

where z\ and Z 2 are the integers determined in the proof of 
Lemma 3. In the case n > 2m, we choose TLf ^ as the union of copies of Gf m 
and z'2 copies of the mappings (Fg)”"™ x (Fg)™ ^ (Fg)™, {x\,X2) ^ /(a^i), 
with / € Hn-m mi where z[ and z '2 are the integers determined in the proof of 
Lemma 4. Then, according to the proofs of Lemma 3 and Lemma 4 and by the 
induction hypothesis, Tif „ is minimal optimally universal. □ 

We finally remark that the hash functions from ^ may in fact be evaluated 
by simple finite field arithmetic. By the above proof it is easy to see that each 
hash function h e „ has the form 

gij{xi,...,Xn) 1 -^ {xi,...,Xi,f{Xi+i,...,Xj)), 

where either i = j and gtj is the identity, or i < j and / is a function in 
Gj_i^rn-i- Hence, h can essentially be evaluated by one finite field multiplication 
and one finite field addition. Although the evaluation of these hash functions is 
simple, there seems to be no obvious uniform algorithm allowing us to choose 
a function from T-Lf ^ in constant time. Nevertheless, it is easy to enumerate 
all hash functions in Tif „ in time \T-Lf ^|. This means also that the blocks of 
the corresponding RBIBDv[fc; u] can be constructed in linear time (with respect 
to the size of their description) ordered by the parallel classes (recall that a 
parallel class in the RBIBD corresponds to a hash function in the hash family). 
Therefore, the algorithm of Hofmeister and Lefmann can be used to compute a 
fc-cut in a graph as stated in Corollary 2 and our modified algorithm has in fact 
the running time stated in Theorem 3. 
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Abstract. The Sorting by Strip Moves problem, SBSM, was introduced 
in [6] as a variant of the well-known Sorting by Transpositions problem. 
A restriction called Block Sorting was shown in [2] to be NP-hard. In this 
article, we improve upon the ideas used in [6] to obtain a combinatorial 
characterization of the optimal solutions of SBSM. Using this, we show 
that a strip move which results in a permutation of two or three fewer 
strips or which exchanges a pair of adjacent strips to merge them into a 
single strip necessarily reduces the strip move distance. We also establish 
that the strip move diameter for permutations of size n is n — 1. Further, 
we exhibit an optimum-preserving equivalence between SBSM and the 
Common Substring Removals problem (CSR) - a natural combinatorial 
puzzle. As a consequence, we show that sorting a permutation via strip 
moves is as hard (or as easy) as sorting its inverse. 



1 Introduction 

The Sorting by Strip Moves problem, SBSM, was introduced in [6] as a variant 
of the well-known Sorting by Transpositions problem SBT [1, 4, 5, 7]. Let tt be a 
string which is a permutation of a finite set, say {1,2, . . . ,n}. A transposition 
is the operation of picking up a substring of tt and placing it elsewhere in the 
string. The SBT problem asks for a shortest sequence of transpositions that can 
transform tt into the identity permutation id„ = 1 2 . . .n. A strip move is a 
restriction of a transposition: the substring being moved must be a maximal 
substring of tt which is also a substring of id„. Such maximal substrings are 
called strips; for e xamp le, in the permutation 523416 on 6 elements, there 
are four strips [^, | 234 10 and [^. The Sorting by Strip Moves problem, SBSM 
asks for a shortest sequence of strip moves that can transform tt to id„. 

Analyzing and understanding strip moves combinatorially is worthwhile for 
two reasons. Firstly, strips are an important combinatorial object in undertand- 
ing transpositions. In particular, strip move sorting directly gives a factor 3 

* Part of this work was done when this author was with the Department of Mathe- 
matics, IIT Madras, India. 
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approximation for sorting by transpositions [6], and there are optimal transpo- 
sition sequences which do not, at any stage, break existing strips [3]. Secondly, 
a restriction, sorting by canonical strip moves (move a strip to a place where it 
merges with another strip) is shown in [2] to be NP-complete, while [6] shows 
that it is actually equivalent to SBSM. While [2] mentions various heuristic 
approaches to solving SBSM, and provides sufficient conditions under which a 
sorting sequence is optimal, [6] devises a 2-approximation algorithm for SBSM 
using certain combinatorial properites about optimal strip move sequences. 

In this paper, we continue the task of exploring the combinatorial structure 
of optimal strip move sequences. We show that the strip move distance s( 7 t) of 
a permutation tt equals the size of a largest compatible edge set in an associ- 
ated order graph Gt^ (Theorem 1). This allows us to establish the strip-move 
diameter (Corollary 2) for permutations of size n; note that the corresponding 
question for general transpositions is still unsettled. Using our characterization, 
we establish that certain strip moves are optimal: they necessarily reduce the 
strip move distance. In particular, a strip move that reduces the number of 
strips in the permutation by 2 or 3 is optimal (Theorem 2). Also, if two adja- 
cent strips are consecutive but out of order, then t he strip m ove t hat exch anges 
them is optimal (Theorem 3). Thus, for instance, s( 78 4 6 5 123 9)> 
s( pT8] |4 5 6| |l 2 3| 0) > s( pT8] 1 1 2 3 4 5 6| [^). 

We also establish the equivalence of Sorting By Strip Moves and the following 
combinatorial puzzle. Let tt,4> be a pair of permutations of the same length, 
viewed as strings, with tt 4>. We wish to quantify how different these strings are. 
One way to do this is the following: identify a substring common to both tt and 
4> and delete it. Repeat this process until both strings are the same. How many 
such deletions are needed? We denote the minimum number of deletions needed 
by csr{TT,(j)), and we define the Common Substring Removals problem, CSR, as 
the problem of computing csr(7r, <j)) for a given tt, (j). For example, consider the 
pair 7T = 3 5 1 4 2 and (j> = 1 A 3 2 5 from S'5. An optimal common substring 
removal sequence that makes this pair identical is (3 5 1 4 2, 1 4 3 2 5)^ 
(5 1 4 2 , 1 4 2 5) ^ (5, 5). We show that SBSM and CSR are computationally 
equivalent (Theorem 4). The reductions preserve optimal values; thus the 2- 
approximation algorithm of [6] for SBSM is also a 2-approximation algorithm 
for the CSR problem. An interesting consequence of this equivalence is that 
sorting a permutation via strip moves is as hard (or as easy) as sorting its 
inverse (Corollary 3). The analogous result for transpositions is quite obvious, 
but, given the way strip moves are defined, there is no direct way of seeing this 
for strip move sorting. 

Some of our results are subsumed by results of [2, 6], but the proof technique 
used here is completely different. 

Due to space constraints, some proofs are omitted. See [8] for details. 

2 Some Preliminaries 

We follow notation from [6]. We denote by [n] the set {1,2,..., n}, by the set 
of all permutations over [n], and by id„ the identity permutation on n elements. 
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For 7T G Sn, a strip of tt is a maximal common substring of tt and id„. A 
substrip is a substring of a strip. If a strip a is the substring u u + \ ... u + t 
for some u,t, then the strip containing u — 1 (or m + t + 1) is said to be the 
predecessor {successor, respectively) of a. Moving a to the immediate right of 
u — 1 or to the immediate left of u + t + 1 is called a canonical strip move. (If 
u = 1 or u + t = n, then moving a to the extreme left or right repsectively is also 
a canonical strip move.) The strip move distance s{tt) is the minimum number 
of strip moves needed to sort tt; the corresponding minimum if strip moves are 
required to be canonical is the canonical strip move distance sc(7t). 

Given a permutation tt, replace each strip x by any one representative ele- 
ment to obtain a sequence S. Now, replacing each a € S' by its rank in S, we 
obtain a permutation on k elements, where k is the number of strips in tt. This 
permutation is called the kernel of tt, denoted ker(7r). 

It is known that s(7t) = sc{n) = s(ker(7r)) [6] and that computing sc{n) is 
NP-hard [2]. 

3 Compatible Edge Sets and Optimal Solutions of SBSM 

Given a permutation tt G S„, we define an order graph associated with it. 
We then define a property, which we call compatibility, of subsets of edges of G^. 
We establish that optimally sorting a permutation tt by strip moves is equivalent 
to finding a largest compatible edge set in G,r- 

Definition 1. The order graph of a permutation it € Sn is the directed graph 
Gtt = (Er,£^ 7 r) where IG = [n] and = {(u,u) \ {u < v) A {n~^ < 7t“^)} = 
{(tT^TTj) I {i <j) A (7Ti < TTj)}. 

Definition 2. 1. Two edges (u,v) and {w,x) of Et^ are said to interleave by 

value ifu<w<v<x. They are said to interleave by position if < 
7r“^ < 7r“^ < 7 t“^. They are said to interleave if they interleave either by 
value or by position (or by both). 

2. An edge {u, v) is said to contain an edge {w, x) of if u<w<x<v 

(contain by value) or ^ (contain by position). 

3. For any G C E^r, the inclusion graph G(G, tt) corresponding to G is the 
directed graph {G,A) where A = {((m, w), (w, a;)) | {u,v) contains (w,a;)}. 

4-. A set G C is said to be compatible if no two edges of C interleave and if 
G{G, 7t) is acyclic. 

5. c,r is the size of any compatible set of edges of maximum cardinality. 

This definition of compatible edge sets generalises that of non-crossing sets, 
introduced in [6] to eficiently solve a certain variant of SBSM exactly. 

Note that if G is a compatible edge set, then the subgraph {[n],G) is a 
collection of vertex-disjoint directed paths. Also, if G is a compatible edge set 
and G' C G, then G' is also compatible. 

The main result of this section is that for all tt G Sn, n > 1, s(7t) = n — 1 — c,r- 
We need to establish several lemmas to finally prove this theorem. 
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We call (u,v) a unit edge ii v — u = — tt~^ = 1. Thus unit edges are 

exactly those edges of the order graph which link the adjacent elements of a 
strip. The following is easy to see. 

Lemma 1. Every compatihle set in Et^ of size contains all unit edges ofG^. 

Definition 3. Eor a permutation tt, let C be any compatible subset of E^^. A 
strip a of TT is said to be free with respect to C if every edge of C has neither or 
both endpoints in a. 

Lemma 2. Eor a permutation it, let C be a compatible subset of of size Ct^, 
and let a be a strip of tt free with respect to C . Moving a gives a permuta- 
tion a with c„ > Ctt- Eurthermore, if a is moved canonically ( to predecessor or 
successor) to obtain a, then c„ > c,r- 

Proof. Since a is a free strip, G is a subset of Ea- as well. Clearly, it has no edges 
interleaving by value. No edges interleave by new position either, since even in 
the new position no edge has just one endpoint in a. And the graph G{C, a) is 
acyclic as well, since the edges other than those in a form no cycles, and the 
unit edges within a do not contain any other edge by value or by new position. 
So G is a compatible subset of E^. Hence > |G| = c,r- 

Further, if a is moved canonically, then G„ has an extra unit edge created by 
this move, which is not even in G. So by Lemma 1, |G| < Ca- Thus > Ct^- □ 

Lemma 3. If a is obtained from tt via a strip /sub strip move, then Ca > Ct^ — 1. 
Eurthermore, if a is obtained from tt via a canonical strip move, then Co- > c,r- 

Proof Let G be a compatible set of edges in G^ of size c^. By Lemma 1, it 
contains all the unit edges of G^. We now show that there is a compatible edge 
set G' of cr with \C'\ > \G\ — 1. 

Let a be the strip or substrip moved in order to obtain cr from tt. Let m be 
the number of edges of G with one endpoint in a and one endpoint outside a. 
Clearly, m G {0, 1, 2}. If m = 0, then a is a free strip with respect to G, and the 
result follows from Lemma 2. So now assume that m > 0. 

Case 1: m=l. Let {u,v) be the single edge of G with an endpoint in a and an 
endpoint outside it. (If a is a proper substrip, then this is in fact a unit edge.) 
As argued in the proof of Lemma 2, G' = G \ {(u, u)} is a compatible edge set 
in Et^ and E^, showing that Ca- > — 1. 

Further, if a is moved canonically, then G^ has an extra unit edge created 
by this move, which is not in G' . So by Lemma 1, \G'\ < c„ and hence > Cj^. 
Case 2: m=2. Let a be the (sub)strip u+l,u + 2, ...,v+fc, with 7r“+^- = i-\- j 
for j G [k]. Since G contains all the unit edges of Gj^, the two edges straddling a 
must be of the form (m,u + 1) and (v-hk,w). Let G' = G\{(m,u + 1), (v-hk,w)}, 
and G" = C' LI {{u,w)}. As argued above, G' is a compatible edge set in and 
in Ea. (In particular, G(G', cr) is acyclic.) We now show that G" is a compatible 
edge set of Ga, proving that Ca > — 1. We only need to show that adding 

(u,w) keeps this set compatible. 
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C has a path p = u,v + l,v + 2, . . . ,v + k,w which is replaced in C" by two 
paths: a single-edge path p' = u,w and the path p" = v + l,v + 2, ... ,v + k; all 
other edges of C and C” are the same. Since no other edge of C can interleave 
with p by value, it is clear that no other edge of C” can interleave with paths 
p' and p” by value. Since the relative positions of all elements other than v + 
1, . . . ,v + k is unchanged, no edge of C and C" can interleave with p' by position 
either. So C” has no interleaving edges. 

We must now show that G{C” , a) is acyclic. Assume it is not; then any cycle 

0 in it must pass through the vertex (u, w), since we already know that G{C , a) 
is acyclic. Since the moved unit edges corresponding to a do not contain any 
edge by value or by position, none of the corresponding vertices can figure in 
6. Let {xi,yi) be predecessor and (x 2 ,y 2 ) the successor of (u,w) in 9. Then 
(xi,yi) must contain all edges of p in And either {u,v + 1) or {v + k,w), 
both edges of p, must contain {x 2 ,y 2 ) in E-k- The remaining edges of 9 are present 
in G(C', 7 t) as well. Thus, corresponding to 9, there is a cycle 9' in G{C,tt) as 
well, contradicting the compatibility of C. 

Further, if a. is moved canonically, then G„ has an extra unit edge created by 
this move, which is not in C” . So by Lemma 1, \G"\ < Ca and hence Ca > Ct^- □ 

Lemma 4. If tt is obtained from a by a strip move, then < Co- -I- 1. 

Proof. Given cr, let tt be obtained through a move of some strip a on cr. Then a 
is either a strip or a substrip in tt, and so, moving it back to its original place is 
a strip or substrip move from tt yielding cr. Now apply Lemma 3. □ 

Lemma 5. Given tt yf id„ and a compatible set C C of size Ct^, we can find 
a canonical strip move p on tt leading to a a with Co- = -I- 1. 

Proof. By Lemmas 2 and 4, it suffices to show that there exists a strip a free with 
respect to G. Consider the inclusion graph G{C, tt); it is acyclic by definition. All 
unit edges of have zero out-degree in G(G, tt). Let G' be the set of non-unit 
edges of G; C' is also a compatible set. If G' = 0, then G consists only of unit 
edges of G^^. Therefore, all strips of tt are free with respect to G. Otherwise, 
consider the subgraph G' of G(G, tt) induced by the vertex set G' . By heredity, 
G' is also acyclic. Choose any vertex (u,v) S C' of zero out-degree in G'; let 

1 = TT~^, j = TTf^. Let A be the set {m -I- 1, . . . , w — 1} U {tTi+i, . . . , ttj-i}. Since 

(u, v) is not a unit edge, A is non-empty. Since (u, v) is of out-degree zero in 
G' , it contains no non-unit edge of G. So the strips containing any element of A 
must all be free with respect to G. □ 

We now prove the main theorem of this section. 

Theorem 1. For all tt e S'„, n > 1, s(7t) = n — 1 — c,r = sc(7t). 

Proof. We will show that (i) s{tt) > n — 1 — c^, and (ii) sc{tt) < n — 1 — c^. Since 
s(7t) < sc(7t), the result follows. 

To prove (i), let pi, p 2 , . . . , Pk be a sequence of strip moves sorting tt. Let 
7T° = TT, and 7T* be the permutation obtained by applying strip move pi to 
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7T*“^. Let Ci denote c^i. By Lemma 4, a < Ci-i + 1. Since tt^ = id„, and since 
Cid^ = n— 1, we have n— 1 = Ck < Cfe_i + 1 < . . . < c\ + {k — l) < co + fc = c,r + fc, 
hence k > n— 1 — c,r- This holds for any strip move sorting sequence; in particular, 
it holds for an optimal sequence with k = s{tt). Hence s(7t) > n — 1 — c^. 

To prove (ii), repeatedly apply Lemma 5 and note that Cid„ = n — 1. □ 

Corollary 1. s(7t) = s(kerf7rj). 

If 7T G and 7T yf rev„ = n n — 1 . . . 2 1, then clearly > 1, and hence 
s(7t) < n — 2. For tt = rev„, c,r = 0, and so s(rev„) = n — 1. Thus we have: 

Corollary 2. D{n) = maa;{s(7r)|7r G S'„} = n — 1. 

Lemma 3 and Theorem 1 imply that canonical strip moves do not increase 
the strip-move distance. In the next two sections, we identify some canonical 
strip moves which provably reduce it. 

4 2-Moves and 3-Moves Are Provably Optimal 

In this section, we establish that strip moves which reduce the number of strips 
by 2 or more necessarily reduce the strip move distance. This result is proved in 
many stages. 

Lemma 6. If tt G S'„ has tt^ = u — 1 and TVi+i = v for some i and some 
u < V, and if the elements u,u+ l,u+2, . . . ,v — 1 form a strip a of tt, then the 
permutation a obtained from tt by moving a to between u and v has s(a) < s(tt). 

Proof. From Corollary 1, s(7t) = s(ker(7r)). In ker(7r), a is a single element. So 
it suffices to prove the lemma when |a| = 1, i.e. v = u + 1 and a = u. 

Let TT have tt^ = u — 1 and 7Tj_|_i = m -I- 1. By Lemma 2 and Theorem 1, it 
suffices to prove that there is some compatible edge set of size with respect 
to which the strip containing u is free. 

Let C be any compatible subset of of size c,r- If m is free with respect to 
C there is nothing to prove. Otherwise, without loss of generality, assume that 
k = TT~^ > i + 1] the situation where k < i is symmetric. There are three cases. 
Case 1: There is a single edge (m, w) in C incident on u. Then C has no edge of 
the form (x, u-l- 1), since such an edge interleaves by value with (u,w). If C does 
not have any edge (u— 1, a:), then let C = C\{{u, w)}U{(u— 1, u-|-l)}. Otherwise 
let (u — 1, a;) be an edge in C and let y be the rightmost endpoint of the path in 
C containing at -|- 1. Let C = C \ {(u, w), {u — 1, a:)} U {(u — 1, at -I- 1), (y, a:)}. It 
can be seen that C is a compatible subset of of size c,r- And u is free with 
respect to C' . 

Case 2: There is a single edge (w,u) in C incident on u. Then (at — 1, at -I- 1) 
cannot be in C. If C has no edge with right endpoint at -I- I , then the set C = 
C \ {(ac, at)} U {(at — I, at -I- I)} is a compatible subset of Et^ of size c,r, satisfying 
Case 1 above. Otherwise, let {x,u+ 1) be an edge in C. To avoid interleaving 
by value with (w,u), we have x < w. So (x,u + 1) contains {w,u) by value. 
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If 7r“^ < 7r“^, then {w,u) contains (x,u + 1) by position, creating a cycle in 
G(C, 7 t).So 7T“^ > TT~^. But if 7 t“^ < i, then (a;, u+ 1) and {w, u) would interleave 
by position. So the relative ordering of these elements must be x,u—l,u+l,w,u. 
Now the set C' = C\{(w,u), (a;,M+l)}U{(a:,M— 1), (m— l,M+l)}is acompatible 
subset of of size c,r, and u is free with respect to C'. 

Case 3: There are two edges touching u, (w, u) and (m, x). There can be no edge 
into u + 1 in C since any such edge will interleave with (u,x) by value. There 
can be no edge out of u — 1 in C since any such edge will interleave with (w, u) 
by value, unless w = m — 1. If w = m — 1, set C" = C \ {{u — 1, u), {u, a;)} U {(m — 
I,M+ I), (y,a;)} where y is the rightmost endpoint of the path in C containing 
u+l. Otherwise set C = G \ {(w, m), (u, a:)} U {(m — 1, m + I), (w, a;)}. Either way, 
C is a compatible subset of Et^ of size c-„, and u is free with respect to C'. □ 

Lemma 7. If a permutation tt € Sn has 7Ti_i = u — 1 and ttj = u for some 
u,i,j > i+l,u, and if the elements 7Ti+i, 7Tj_|_2, . . . , ttj-i form a strip a of tt, 
then the permutation a obtained from tt by moving a eanonically has s(a) < s(7t). 

Proof. Since s(7t) = s(ker(7r)), it suffices to prove the lemma when |a| = 1. 

Let TT have TTi-i = u—1, TTi+i = u, and tt^ = v. By Lemma 2 and Theorem 1, 
it suffices to prove that there is some compatible edge set of size c,r with respect 
to which the strip containing v is free. 

Let G be a compatible edge set of G,r of size c,r- If r" is free with respect 
to G, there is nothing to prove. If G has a single edge e touching v, then edge 
{u — 1,m) cannot be in G since it interleaves with e by position. The set C = 
G \ {e} U {(m — l,u)} is a compatible set of of size Ct^, and v is free with 
respect to it. If G has two edges {w,v) and (v,x) incident on v, then C = 
C \ {{w, v), (v, a;)} U {(w, x), (u — 1, u)} is a compatible set in G,r of size c,r, with 
respect to which v is free. □ 

Since the goal in SBSM is to move from an unsorted permutation towards 
id„ which has just one strip, it is natural to prefer strip moves that rapidly 
reduce the number of strips. With the results we have seen, we can formalise 
this intuition. Define an a:-move to be a strip move that results in a permutation 
with x fewer strips. (Note that a strip move cannot decrease the number of strips 
present by more than 3.) Since a 2-move satisfies the assumption of Lemmas 6 
or 7 and since a 3-move satisfies the assumption of both, we now have the main 
result of this section: 

Theorem 2. If a 2-move or a 3-move applied to tt gives a, then s(cr) < s(tt). 

5 Certain 1-Moves Are Provably Optimal 

We furnish another application of Theorem 1. We show that a strip move that 
exchanges a pair of adjacent substrings which are adjacent in the identity per- 
mutation can be part of an optimal strip move sorting sequence. Note that such 
a move may not even be a 2-move. To prove this, we establish an interesting 
property, which we call additivity , of the function s(7t). 
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Theorem 3. If a, P are adjacent strips in tt such that f3a is a substring o/id„, 
then (j) obtained from tt by exchanging a and (3 (which is a canonical move of a 
or (3) has s{4>) < s{tt). 

Proof Let the substring (3a he u + I u + 2 ... u + t for some u, t, and let 
it occur in positions t + 1, i + 2, i + t of 4 > for some i. If TTi = u or if 

TTi+t+i = u + 1 + 1, then exchanging a and /3 in tt is a 2-move or a 3-move, and 
hence, by Theorem 2, reduces the strip move distance. So now let us assume that 
TTi ^ u and TTi+t+i u -I- 1 -I- 1. Clearly, one way to sort tt is to exchange a and f3 
and then sort (f>\ thus s( 7 t) < l-|-s((^). We show that this is in fact an equality, by 
defining a notion of complete substrings and establishing a more general result 
concerning them. The proof follows from the statement of Lemma 8 below and 
the fact that aP forms a maximal complete substring in tt. □ 

Definition 4. A substring tt^+i . . . is complete if {71^+1 , . . . , TTj+i} = {u -I- 
1, M -I- 2, . . . ,u + t} for some u. That is, the t consecutive positions i + l,i + 
2 , . . . ,i + t hold t consecutive integers, though not necessarily in sorted order. 
If, furthermore, TTi u and TTi^t+i u + t + 1, then the substring is said to be 
maximal complete. 

The strip move distance for the complete substring tt^+i . . . TTi+t is defined to 
be the minimum number of strip moves needed to sort it i.e. to transform it into 
u + 1 . . .u + t. 

Lemma 8. If tt contains a maximal complete substring a with s(a) = q, then 
s(tt) = q + s(a), where a is the permutation obtained from tt by replacing a, in 
place, with its sorted version. 

Proof. Since a is maximal complete, it is clear that s{tt) < q + s(cr). So we 
only need to show that s( 7 t) > q + s(a). By Theorem 1, it suffices to show that 

Ca > q + c^- 

Let a = 7Ti_|_i . . . 7Ti_|_i where {71^+1 , . . . , TTj+i} = [u -|- 1, u + f\ for some u. Let 
C be any compatible edge set of of size • Partition C into four subsets as 
follows: 

Gi = {(u, w) I u, w G [m -I- 1, M -I- t]} both endpoints in [u -I- 1, u + t] 

C*2 = {(u, w) I 7r“^ < 7, w G [m -I- 1, u + f\} right endpoint in [u -I- 1, u + t] 
G3 = {(u, w) I u G [m -I- 1, u + t], 7r“^ > i + t} left endpoint in [m -|- 1, u + t] 

C 4 = C \ {Cl U G2 U G3) no endpoint in [m -|- 1, u + t] 

It follows from Theorem 1 that |Gi| < Cq, = t — 1 — g; i.e. t — 1 — |Gi| > q. 
If IG2I < 1 and IG3I < I, we construct a compatible set in of the required 
size as follows: Let G5 be the set of t — 1 unit edges obtained after sorting a; 
G5 = {{u+i, u-l-7-1-1) I i G [t— 1]}. Define C = G4UG5. Then C is a subset of E„, 
and it is easy to see that G' is compatible. If G2 U G3 = 0, then |Gi| -I- IG4I = c^, 
so G' is the desired set. Otherwise, Otherwise, G" = G' U {(u, m -I- I) | {v,u + i) G 
C 2 for some 7 G [t]} U {(77-I- 1, w) | {u + j,w) G G3 for some j G [t]} is the desired 
set; compatibility and size bounds are easily seen to hold. 
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If either IC 2 I or jCsj exceeds 1, we show how to construct another compatible 
subset of of size c,r, in which the corresponding subsets IC 2 I and jCsl are both 
of size at most 1. Then we can use the above argument. 

Assume that IC 2 I = a > 2. Let the edges of C 2 be ( 7 Tij, 7 rjJ, (TTijjTTj^), . . . , 
(TTi^jTTj^), where Ji < J 2 < • ■ • < ja- By compatibility considerations, we have 
la < • ■ • < *2 < *1 < ji < J 2 < ■ • ■ < ja- Let the unique path in C containing 
TT^y end at the right at TTfc^ . From compatibility considerations, it can be argued 
that {TTku,T^jH-^i) G We define \ he [a-1]}. 

If jCsl = 6 > 2, then a set C 3 of the same size is constructed in a symmetric 
way. Let the edges of C 3 be {iTp ^ , tt, J, (tt^^ , , . . . , (tt^^ where ph < ... < 

P 2 < Pi ■ By compatibility considerations, we have pb <■■■< P 2 < Pi < qi < 
Q 2 <■ - ■< qb- Let the unique path in C containing end at the left at 
Again, we can argue that the edges (j^ph+n'^rh) ^tre indeed present in and 
we define C'^ = {( 7 rpi, 7 r,J} U {( 7 rp,._^,, 7 rrJ | h G [6-1]}; 

If both a,b > 2, then it must be that ja < Pb, since otherwise there will be 
edges in C interleaving by position. 

Clearly, C 2 U C 3 is of the same size as C 2 UC 3 , and is disjoint from C 1 UC 4 . 
We now show that C" = Ci U C 2 U C 3 U C 4 is also a compatible subset of 

It is easy to see that none of the edges of C 2 U Cg interleave by position. 
Neither do edges of Ci U C 4 . And by our choice of C, an edge of C 2 U Cg will 
not interleave by position with an edge of Ci UC 4 . 

If edges of C interleave by value, then the corresponding edges of C can 
similarly be shown to interleave by value, since a is a complete substring. 

It can be argued that if the inclusion graph G(C', 7 r) has a cycle, then so 
does G(G, 7 t), contradicting compatibility of G. □ 

Note: The above result can be shown to hold even if a is complete but not 
maximal. 



6 CSR and SBSM Are Equivalent 

We first obtain an alternative formulation of SBSM and then show that this 
alternative formulation is equivalent to CSR. This then implies that sorting a 
permutation via strip moves is as hard (or as easy) as sorting its inverse. 

We say that an increasing substring Oi . . . aj of a string S = ai 02 ■ • ■ a™ 
of distinct elements from [n] is tight if there is no t < k < j — 1 such that 
Qk < ai < Ofc+i for some a/ in S. (In other words, if each ai in S is replaced by 
its rank r(a/) in S, then the substring r(oi) . . . r{aj) is a substrip in the resulting 
string. ) Let isr{S) denote the length of a shortest sequence of tight increasing 
substring removals on S that leaves behind an increasing subsequence of S. 

For example, consider 3 5 1 4 2. Neither of the increasing substrings (3, 5) 
and (1,4) is tight, and removing any one element does not make the sequence 
increasing, so isr{3 5 1 4 2) > 1. But after deleting 4, the sequence (3, 5) is tight, 
and deleting it leaves behind an increasing sequence 1 2. So isr{3 5 1 4 2) = 2. 

The following is easy to see. 
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Lemma 9. For any permutation tt , isr^ir) = s { tt ). 

Recall that csr{-K,(f)) denotes the length of a shortest sequence of common 
substring removals that makes the pair identical. Given any tt, G S'„, for any 
permutation i/' G S'„, is just a consistent re-labelling for (tt, 

Thus csr^TTjip) = csr{tp~^Tr,tp~^4>). In particular, csr{TT,4>) = csr(7r“^(/), id„) = 
csr{4>~^7T, id„). But if we consider csr(a, id„), the common substrings removed 
are always tight increasing substrings. Thus csr ( ct, id„) > isr{a). In fact, we 
have equality csr(cr, id„) = isr{a) because the tight increasing substrings re- 
moved from a are necessarily substrings of id„. These remarks, along with 
Lemma 9, establish the following theorem. 

Theorem 4. The Sorting by Strip Moves problem SBSM and the Common Sub- 
string Removals problem CSR are computationally equivalent. 

For 7T, (/) G Sn, csr{TT, 4>) = s{TT~^(j)) = s((^“^7t). For a G Sn, s(a) = csr(cr, id„). 

Corollary 3. For any tt G Sn, s { tt ) = s { tt ~^). 

7 Conclusion 

In this article, we have shown that certain types of strip moves are optimal; 
they necessarily reduce the strip move distance. Analogous results for sorting by 
transpositions would be very useful. In particular, identifying certain transpo- 
sitions that provably reduce the transposition distance could guide the way to 
efficiently sorting by transpositions, either optimally or with good approximation 
ratios. We believe that the techniques used in this paper can find application to 
the general sorting by transpositions problem as well. 
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Abstract. We consider a variant of Heilbronn’s triangle problem by 
asking for fixed dimension d > 2 for a distribution of n points in the 
d-dimensional unit-cube [0, 1]“^ such that the minimum (2-dimensional) 
area of a triangle among these n points is maximal. Denoting this maxi- 
mum value by A°J ^ (n) and (n) for the off-line and the on- 
line situation, respectively, we will show that < 

and for 

constants ci, cz, Gi, Cz > 0 which depend on d only. 



1 Introduction 

Given any integer n > 3, originally Heilbronn’s problem asks for the maxi- 
mum value A 2 {n) such that there exists a configuration of n points in the two- 
dimensional unit-square [0,1]^ where the minimum area of a triangle formed 
by three of these n points is equal to Z\ 2 (n). For primes n, the points Pk = 
1/n- mod n, k'^ mod n),fc = 0,l,...,n— 1, easily show that Z\ 2 (n) = I7(l/n^). 
Komlos, Pintz and Szemeredi [11] improved this to A 2 {n) = l7(logn/n^) and in 
[5] the authors provide a deterministic polynomial time algorithm achieving this 
lower bound, which is currently the best known. From the other side, improving 
earlier results of Roth [14-18] and Schmidt [19], Komlos, Pintz and Szemeredi 
[10] proved the upper bound A 2 (n) = for some constant c > 0. 

Recently, for n randomly chosen points in the unit-square [0, 1]^, the expected 
value of the minimum area of a triangle among these n points was determined 
to 6>(l/n^) by Jiang, Li and Vitany [9]. 

A variant of Heilbronn’s problem considered by Barequet [2] asks, given a 
fixed integer d > 2, for the maximum value A*Jji) such that there exists a 
distribution of n points in the d-dimensional unit-cube [0, l]'^ where the minimum 
volume of a simplex formed by some (d+ 1) of these n points is equal to A’^{n). 
He showed in [2] the lower bound A*^{n) = I7(l/n‘^), which was improved in 
[12] to A*^{n) = l7(log n/n'^). In [13] a deterministic polynomial time algorithm 
was given achieving the lower bound A‘^{n) = l7(logn/n^). Recently, Brass [6] 
showed the upper bound A*^{n) = for odd d > 3, while for 

even d > 4 only A*^(n) = 0(l/n) is known. Moreover, an on-line version of this 
variant was investigated by Barequet [3] for dimensions d = 3 and d = 4, where 
he showed the lower bounds and 17(1/71^^^/^"^), respectively. 
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Here we will investigate the following extension of Heilbronn’s problem to 
higher dimensions: for fixed integers d >2 and any given integer n > 3 find a set 
of n points in the d-dimensional unit-cube [0, 1]*^ such that the minimum area 
of a triangle determined by three of these n points is maximal. We consider the 
off-line as well as the on-line version of our problem. In the off-line situation the 
number n of points is given in advance, while in the on-line case the points are 
positioned in [0, 1]*^ one after the other and at some time this process stops. Let 
the corresponding maximum values on the minimum triangle areas be denoted 
by and respectively. 

Theorem 1. Let d>2 be a fixed integer. Then, for constants ci , C 2 , Ci , C 2 > 0, 
which depend on d only, for every integer n >3 it is 



(logn)i/(<i-i) 

n2lid-i) 

C2 

n2/(d-i) 



< Z\7-'“^(n) < 



Cl 

jj2/d 

C 2 

n^jd 



( 1 ) 

(2) 



The lower bounds (1) and (2) differ only by a factor of 0((logn)^/^‘^“^^). In 
contrast to this, the lower bounds in the on-line situation considered by Barequet 
[3], i.e. maximizing the minimum volume of simplices among n points in [0, l]'^, 
differ by a factor of 0(n^C . logn) for dimension d = 3 and by a factor of 
g)(^3i/24 .^ogn) for dimension d = 4 from the currently best known lower bound 
Z\J(n) = l7(logn/n‘^) for any fixed integer d > 2 in the off-line situation. 

In the following we will split the statement of Theorem 1 into a few lemmas. 



2 The Off-Line Case 



A line through points Pi,Pj G [0,1]^^ is denoted by PiPj. Let dist (Pi,Pj) be 
the Euclidean distance between the points Pi and Pj. The area of a triangle 
determined by three points Pi,Pj,Pk G [0,1]*^ is denoted by area {Pi,Pj,Pk), 
where area (Pi,Pj,Pk) := dist (Pi,Pj) ■ h/2 and h is the Euclidean distance 
from point Pk to the line PiPj. 

First we will prove the lower bound in (1) from Theorem 1, namely 



Lemma 1. Let d>2 be a fixed integer. Then, for some constant c\ = ci(d) > 0, 
for every integer n> 3 it is 



Aoff — line/ \ \ 

[n)>ci- 



(logn)^/(‘^ 

rPl0-\) 



(3) 



Proof. Let d > 2 be a fixed integer. For arbitrary integers n > 3 and a constant 
Of > 0, which will be specified later, we select uniformly at random and inde- 
pendently of each other N = points Pi, P 2 , . . . , Pn in the d-dimensional 
unit-cube [0,1]^^. For fixed i < j < k we will estimate the probability that 
area {Pi, Pj, Pk) < A for some value A > 0 which will be specified later. The 
point Pi can be anywhere in [0, l]'^. Given point Pi, the probability, that point 
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Pj G [0,1]^^ has a Euclidean distance from Pi within the infinitesimal range 
[r, r + dr], is at most the difference of the volumes of the d-dimensional balls 
with center Pi and with radii (r -|- dr) and r, respectively. Hence we obtain 

Prob (r < dist {Pi, Pj) < r -|- dr) < d ■ Cd • r‘^~^dr , (4) 



where throughout this paper Cd is equal to the volume of the d-dimensional unit- 
ball in Given the points Pi and Pj with dist (Pi,Pj) = r, the third point 

Pk & [0, satisfies area {Pi, Pj, P^) < A, if P^. is contained in the intersection 

Cij n [0, with Cij being a d-dimensional cylinder, centered at the line PiPj 
with radius 2- Ajr and height at most Vd. The d-dimensional volume vol {Cij fl 
[0, l]'^) is at most Cd-i ■ {2A/r)'^~^ ■ ^/d, and we infer for some constant > 0: 



Prob (area {Pi,Pj,Pk) < A) 

rVd 

< / vol (Cij n [0, l]'^) • d • Cd • dr 
do 

j-Cd f2 ■ A\ r- 

< J Cd_i • — j ■Vd-d-Cd-r‘^~^dr 



= Cd-i ■ Cd ■ 2^-1 • d^/2 . ^<^-1 . 




dr = C'a- . 



( 5 ) 



Definition 1. A hypergraph Q = {V,S) with vertex set V and edge set S is 
fc-uniform if \E\ = k for all edges E G S. 

A hypergraph Q = {V,E) is linear if \E C\ E'\ < 1 for all distinct edges 
E,E' G £. 

A subset I QV of the vertex set is independent if I contains no edges from 
£. The largest size \I\ of an independent set in Q is the independence number 

a{g). 



We form a random 3-uniform hypergraph Q = {V,S^) with vertex set V = 
{1,2,..., N}, where vertex i corresponds to the random point Pi G [0, l]'^, and 
with edges {i,j,k} G £3 if and only if area {Pi, Pj, Pk) < A. The expected 
number E{\£ 3 \) of edges satisfies by (5) for some constant Cd > 0: 



E{\£3\) < • C'd ■ A<^-^ < Cd ■ A<^-^ ■ . 



( 6 ) 



We will use the following result on the independence number of linear k- 
uniform hypergraphs due to Ajtai, Komlos, Pintz, Spencer and Szemeredi [1], 
see also [7], compare Fundia [8] and [4] for deterministic algorithmic versions of 
Theorem 2: 



Theorem 2. [1, 7] Let k > 3 be a fixed integer. Let Q = iy,£) be a k-uniform 
hypergraph on \V\ = n vertices with average degree t^~^ = k ■ |£|/|U|. Lf Q is 
linear, then its independence number a{Q) satisfies for some constant > 0.' 

a{G)>cl-j-{logt)^ . 



( 7 ) 
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To apply Theorem 2, we estimate in the random hypergraph Q = (V, £3) the 
expected number E{BPu^{Q)) of ‘bad pairs of small triangles’ in Q, which are 
among the random points Pi,. ■ . ,Pn G [0, those pairs of triangles sharing 
an edge and both with area at most A, where all sides are of length at least Dq. 
Then we will delete one vertex from each ‘pair of points with Euclidean distance 
at most Dq’ and from each ‘bad pair of small triangles’, which yields a linear 
subhypergraph Q** of Q and Q** fulfills the assumptions of Theorem 2. 

Let Pdo(G) be a random variable counting the number of pairs of distinct 
points with Euclidean distance at most Dq among the N randomly chosen points. 
For fixed integers j, 1 < * < J < N, we have 

Prob (dist {Pi, Pj) < Dq) < Cd ■ Dq , 

as point Pi can be anywhere in [0, 1]“* and, given point Pi, the probability that 
dist {Pi, Pj) < Dq is bounded from above by the volume of the d-dimensional ball 
with radius Dq and center Pi. For Dq := 1/iV^, where /3 > 0 is a constant, the 
expected number E{Pd^{Q)) of pairs of distinct points with Euclidean distance 
at most Dq among the N points satisfies for some constant > 0: 

E{Pdo {G)) < • Cd ■ Di < c'^ ■ . (8) 

For distinct points Pi,Pj,Pk,Pi, 1 < i < j < k < I < N, there are (2) choices 
for the joint side of the two triangles, given by the points Pi and Pj, say. Given 
point Pi, by (4) we have Prob (r < dist {Pi,Pj) < r + dr) < d-Cd-r'^~^dr. Given 
points Pi and Pj with dist {Pi,Pj) = r, the probability that the triangle formed 
by Pi, Pj,Pk, or by Pi, Pj, Pi, has area at most A, is at most the volume of the 
cylinder, which is centered at the line PiPj with height y/d and radius 2 • A/r. 
Thus, for d > 3 we have for some constant C^' > 0: 



Prob (Pj, Pj,Pk, Pi form a bad pair of small triangles ) 




For dimension d = 2 the expression (9) is bounded from above by C 2 ■ A'^ ■ 
log for a constant C 2 > 0. Thus, for each d > 2 we have 

Prob (Pj, Pj,Pk, Pi form a bad pair of small triangles ) 

< C'd ■ . fVd(d-2) . log iV , 
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and we obtain for the expected number E{BPdq{Q)) of such bad pairs of small 
triangles among the N points for a constant c^' > 0 that 

E{BPd,{G)) < • C'^ ■ . log TV 

< Crf • . j^A+P(d-2) . log TV . (10) 

Using (6), (8) and (10) and Markov’s inequality there exist N = points 
in the unit-cube [0, 1]*^ such that the corresponding 3-uniform hypergraph Q = 
(U, £3) on \V\ = N vertices satisfies 

\Ss\ < 3 ■ Cd ■ ( 11 ) 

PdAG)<3-c'^-N^-^^ (12) 

BPdo {G) <3- 4 • • TV^+/5(rf-2) . log TV . (13) 

By (11) the average degree = 3 ■ |£3|/|U| of ^ = (U, £3) satisfies := 

9 • Cd • A'^~^ • N^/N = 9 ■ Cd ■ A'^~^ ■ N'^. For a suitable constant £ > 0, we pick 
with probability p := N'^ /to < 1 uniformly at random and independently of each 
other vertices from V. Let U* C U be the random set of the picked vertices, 
and let G* = (V*,S^) with = £3 n [U*]^ be the resulting random induced 
subhypergraph of G- Using (11), (12), (13) we infer for the expected numbers 
of vertices, edges, pairs of points with Euclidean distance at most Do and bad 
pairs of small triangles in G* for some constants ci, C2, C3, C4 > 0: 

U(|U*|) =p. TV > Cl • N^/A^ 

E{\£;\) = / . IS3I <p^.i-Cd- -N^<C2- N^yA^ 

E{Pdo{G*)) = / • PooiG) < / • 3 • c'rf • iv2-/3d < C3 . TV^^-dd/^d-i 
E{BPdo{G*)) = / • BPooiG) < / • 3 • c'/ ■ ■ TV^+/3(<i-2) . bgTV < 

< C4 • -logiv . 

By Chernoff’s and Markov’s inequality there exists an induced subhyper- 
graph G* = {y*,£y of G such that the following hold: 



> (ci-o(l))-fV74l^ (14) 

1^3*1 <4-C2-1V^7^'^ (15) 

Pd^G*) < 4 • C3 • TV2"-^7tI^-i (16) 

BPooiG*) < 4 • C4 • TV^^+/5(rf-2) . loglV . (17) 



Now we fix a := 1/d and (3 := 1/d and for some suitable constant c* > 0 we set 

3 . „♦ (logn)^ 

41 . C * 2 

Lemma 2. For 0 < e < (d + 2) /(3 • d • (<^ + 1)) 

BPo^G*) = o{\V*\) . 



(18) 
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Proof. Using (14), (17), (18) and N = we have 



BPn,{G*) = o{\V*\) 

^4e+f3{d-2) . logTV = o{NyA^) 
j^3s+f3{d-2) . ^ 0(X) 

j^(l+a)(3e+/3(d-2))-l . (log rj) I = o(l) 

(1 + a) • (3£ + /3(d-2)) < 1 

d+2 

^ ^ 3 • d • (d + 1) 



as a = P = 1/d. 



Lemma 3. For 0 < £ < l/(d + 1) it is 



Pdo{G*) = o{\V*\) . 

Proof. By (14), (16), (18), using N = we infer 



Pdo(G*) = o{\V*\) 

j^2e-/ddj^d-l ^ I 

N^-d'^/A^ = o(l) 
n("-^^)(i+“)+V(logn)i/2 = o(l) 
(e — Pd)(l + a) < — 1 
1 



as a = P = 1/d. 



We fix £ := l/(3d), hence p < 1 for n > ng. In the subhypergraph G* = 
(V*,£^) we delete one vertex form each bad pair of small triangles with all 
side-lengths at least Dq and from each pair of vertices where the corresponding 
points have Euclidean distance at most Dq. The resulting induced subhypergraph 
G** = (V**,£^*) with £1* = [U**]3n£:3* fulfills by Lemmas 2 and 3 and by (14), 
(15): 

|U**| > (ci -o(l)) (19) 
15**1 < |5*| < 4 • C2 • N^yA^ (20) 

and the points corresponding to the vertices of the subhypergraph G** do not 
form any bad pairs of small triangles anymore, i.e. G** is a linear hypergraph. 
By (19) and (20) the hypergraph G** has average degree 



2 , ,2 12 • C2 • N^yA^ _ (12+o(1))-C2 ^ 2 , 

" ^ ■ (ci - o(l)) • Cl 
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The assumptions of Theorem 2 are fulfilled by the 3-uniform subhypergraph Q** 
of Q and we infer for t > 2 with (7), (19) and (21) for its independence number 



a(0) > a(g**) > ^ • (logt)!/" 



>c* 



|U* 






(logti)^/^ > 



> 



cS • (c?/" - 0(1)) • N- 



{{12 + o{l)) ■ 02^/^ ■ ■ A(d--L)/2 

> c' ■ {log 



^iogj^(i2+^p^y/\^, 



1/2 



as 



N = 



> c'- (1/c*)^ 



(log n)i/2 



• (logn)^/^ > n 



for some sufficiently small constant c* > 0. Thus the hypergraph Q contains 
an independent set I Q V with |/| = n. These n vertices represent n points 
in [0, l]*^, where every triangle among these n points has area at least A, i.e. 



3 The On-Line Case 



In this section we consider the on-line situation and we will show the lower bound 
in (2) from Theorem 1: 



Lemma 4. Let d>2 be a fixed integer. Then, for some eonstant C 2 = C 2 {d) > 0, 
for every integer n> 3 it is 



A 



on-line 

d 



(n) > 



C2 

77,2/ (d— 1) 



Proof. Successively we will construct an arbitrary long sequence Pi,P 2 j--- of 
points in the unit-cube [0,1]“^ such that for suitable constants a,b,a,(3 > 0, 
which will be fixed later, for every n the set Sn = {Pi, P 2 , ■ ■ ■ , Pn} satisfies 

(i) dist {Pi, Pj) > a/n^ for all 1 < i < j < n, and 

(ii) area {Pi, Pj,Pk) > b/n^ for all 1 < z < j < /c < n. 

Assume that a set Sn-i = {P\,P 2 , ■ ■ ■ , Pn-i} C [0, l]'^ of {n — 1) points with (i’) 

dist {Pi, Pj) > a/{n — 1)“ for all 1 < z < j < n — 1, and (ii’) area {Pi, Pj,Pk) > 

bj{n — 1)^ for all 1 < z < j < /c < n — 1 is already constructed. 

To have some space in [0, 1]'^ available for choosing a new point P„ G [0, 1]'’* 
such that (i) is fulfilled, this new point is not allowed to lie within any of the 
d-dimensional balls B^{Pi) of radius r = a/zz“ with center Pi, i = 1, 2, . . . , rz — 1. 
Adding the volumes of these balls yields 



n—1 

Y, vol {Br{P^)) <n-Cd-r‘^ = a‘^-Cd- . 
2=1 



For a := 1/d and a'^ ■ Cd < 1/2 we have '^ol {Br{Pi)) < 1/2. 
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We will show next that the regions, where condition (ii) is violated, also have 
volume less than 1/2. The regions, where (ii) is violated by points G [0, l]'^, 
are given by Ci^ n [0, l]*^, 1 < t < j < n— 1, where Cij is a d-dimensional cylinder 
centered at the line PiPj. These sets Cij n [0, are contained in cylinders of 
height Vd and radius 2 • 6/(n^ • dist (Pi,Pj)). We sum up their volumes: 



^ voi(Ci,,n[o,i]") 

l<z<j-<n— 1 



< ^ Vd-Cd-i- 



2-b 



nl^ ■ dist (Pi, Pj) 



d-l 



{2 ■ ■ Vd ■ Cd-i 

2 • 




1 



d-l 



(22) 



We fix some point Pi, i = 1,2, . . . ,n — 1. To give an upper bound on the last 
sum, we will use a packing argument, compare [2]. Consider the balls Br^{Pi) 
with center Pi and radius rt '■= a - t/n“, t = 0, 1, . . . with t < \/d- n°‘ja. Clearly 
vol (Pro (Pi)) = Oj and for some constant C)) > 0 for t = 1, 2, . . . we have 

VOl (Pr,(P.) \ Pr,_, (Pi)) < C: • ^ . (23) 

Notice that for every ball Br{Pj) with radius r = 0{n~°‘) and center Pj G 
Prt(Pj) \ Br,_^{Pi) with i ^ j we have vol (Pr(Pj) C (Prt(P*) \ Br,_^{Pi)) = 
6>(n““‘^). By inequalities (i’) we have n\ = 1 and by (23) each shell Prt(Pi) \ 
Prt_i (Pi), t = 2,11, ... , contains at most rit <C'^- points from the set Sn-i 
for some constant C'^ > 0. Using the inequality 1 -I- a; < a; G K, we obtain 



1 



dist {Pi,Pj) 



E 

\/d-n'^ / a 

< E 



d—l '/d-n°‘ ja 

< ^ Uf 



1 



a - {t— l)/n“ 



d-l 



< 



1 



a - {t — l)/n° 



d-l \fd-n°‘ja 

^ E : 






^d-1 



< C'^ ■ 



(24) 

for some constant C'^ > 0. We set fd := 2/{d — 1) and, using a = 1/d and (24), 
inequality (22) becomes for a sufficiently small constant b > 0: 

vol (Ci., n [0, 1]'^) 

l<i<j<n—l 

{2 ■ b)'^-^ ■ Vd ■ Cd-i ^ 



< 



< 



{2 ■ Vd ■ Cd-i 

j^f3{d—l) 



■E E 

1=1 j = 
n—1 

i=l 



1 

dist {Pi,Pj) 



< (2 • bf-^ -Vd-Cd-i ■ C'^ ■ < 1/2 . 
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The forbidden regions have volume less than 1, hence there exists a point 
Pn G [0, such that (i) and (ii) are satisfied, thus 

□ 



4 An Upper Bound 



Here we will show the upper bound 0(1 from Theorem 1 on the smallest 
area of a triangle among any n points in the d-dimensional unit-cube [0, l]*^. 

Lemma 5. Let d > 2 be a fixed integer. Then, for some constant C\ > 0, for 
every integer n > 3 it is 

^on-Hne(^) < f -Hue ^ ^ ^ 

Proof Given any n points Pi, P 2 , . . . , Pn G [0,1]^^, for some value Oq > 0 we 
form a graph Gdq = (U with vertex set V = {l,2,...,n}, where ver- 
tex i corresponds to point Pi G [0,1]^^, and edges {i,j} G if if and only if 
dist (Pi,Pj) < Dq. An independent set / C U in this graph Gdq yields a subset 
/' C {Pi, P 2 , ■ • ■ , Pn} of points with Euclidean distance between any two distinct 
points bigger than Dq. Each ball Br{Pj) with center Pj G [0, and radius r < 1 
satisfies vol {Br{Pj)f][0, l]'^) > vol {By{Pj)) /2'^ . The balls with radius Po/2 and 
centers from the set I' have pairwise empty intersection, thus 

a{GDo) ■ 2-^ ■ Cd ■ {Do/2f < vol ([0, 1]'^) = 1 . (25) 



By Turan’s theorem [20], for any graph G = (V,E) we have the lower bound 
a{G) > n/(2 • t) on the independence number a{G), where t := 2- |P|/|U| is the 
average degree of G. This with (25) implies 



4^ 

Gd-Di 



Tl 

> o(G'do) > 



t > 



Cd 

2-U 



■ n ■ D 



d 

0 • 



Let Dq := cjn^!'^ where c > 0 is a constant with > 2 - jCd- Then t > 1 
and there exist two edges {i,j},{i,k} G E incident at vertex i gV. Then the 
two points Pj and Pk have Euclidean distance at most Dq from point Pi, and 
hence area (Pi,Pj,Pk) < Dq/2 = 1/2- i.e. Z\[]'^^“*“®(n) = 0(1/71^/^^). 

□ 



5 Conclusion 

Certainly it is of interest to improve the bounds given in this paper. Also, for 
the off-line case it is desirable to get a deterministic polynomial time algorithm 
achieving the bound view of the 

results in [9] it is also of interest to determine the expected value of the minimum 
triangle area with respect to the uniform distribution of n points in [0, l]'^. 
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Abstract. In this paper, we investigate the maximum weight triangu- 
lation of a point set in the plane. We prove that the weight of maximum 
weight triangulation of any planar point set with diameter D is bounded 
above by {(2e + 2) ■ n+ ^ —5{e+ 1)) D , where e for 0 < e < 5 is a 

constant and n is the numb er of points in the set. If we use ‘spoke-scan’ 
algorithm to find a triangulation of the point set, we obtain an approxi- 
mation ratio of 4.238. Furthermore, if the point set forms a ‘semi-lune’ or 
a ‘semi-circled’ convex polygon’, then its maximum weight triangulation 
can be found in O(n^) time. 

Keywords: Algorithm, Approximation, Maximum weight triangulation. 



1 Introduction 

Triangulation of a set of points is a fundamental structure in computational 
geometry. Among different triangulations, the minimum weight triangulation 
{MWT for short) of a set of points in the plane attracts special attention 
[3, 5, 6, 7]. The construction of the MWT of a point set is still an outstanding 
open problem. When the given point set is the set of vertices of a convex poly- 
gon (so-called convex point set), then the corresponding MWT can be found 
in 0{n^) time by dynamic programming [3,5]. There are many approximation 
algorithms for various MWTs. A constant ratio of MWT of a point set in the 
plane was proposed in [7], However, the constant is so large that its exact value 
is not known. 

On the contrast, there is not much research done on maximum weight trian- 
gulation {MAT for short). From the theoretical viewpoint, the maximum weight 
triangulation problem and the minimum weight triangulation problem attracts 
equally interest, and one seems not to be easier than the other. The study of 
maximum weight triangulation will help us to understand the nature of optimal 
triangulations. 
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The first work in the MAT [9] showed that if an n-sided polygon P inscribe 
d on a circle, then MAT{P) can be found in 0{n?) time. Recently, a factor 6 
approximation algorithm to find the MAT of a set of points in the plane was 
proposed in [4] . 

In this paper, we present in Section 3 a non-trivial upper-bound of the weight 
o f MAT{S) for a set S oin points in the plane. That is, uj{MAT{S)) < ((2 -|- 
2e) ■ n + + f ~ 5(e-|- l))Ds, where Ds is the diameter of S and e is any 

constant such that 0 < e < We also show that when e ^ 0, the upper-bound 
is tight. 

Applying a simple ‘spoke-scan’ approximation algorithm in [4], we obtain a 
triangulation T{S) with ratio =)4.238, which improves the previous 

ratio of 6. Furthermore, let P be a special convex n— sided polygon such that the 
entire polygon P is contained inside the circle with an edge of P as diameter. We 
call such a polygon as semi-circled. In Section 4, we propose an 0{n^) algorithm 
for computing the MAT{P) of a semi-circled convex polygon P. Recall that a 
straightforward dynamic programming method will take O(n^) to find MAT{P). 
This algorithm works for more general so-called ‘semi-lune’ convex polygon. 

1.1 Preliminaries 

Let S' be a set of points in the plane. A triangulation of S, denoted by T(S), is 
a maximal set of non-crossing line segments with their endpoints in S. It follows 
that the interior of the convex hull of S is partitioned into non-overlapping 
triangles. The weight of a triangulation T{S) is given by 

lv{T{S)) = 

^GT(S) 

where oj{i,j) is the Euclidean length of line segment sTsJ. 

A maximum weight triangulation of S {M AT(S)) is defined as for all possible 
T(S), uj{MAT{S)) = max{uj{T{S))}. 





Fig. 1. An illustration of semi-lune and semi-circled convex polygons. 



Let P be a convex polygon (whose vertices form a convex point set) and T{P) 
be its triangulation. A semi-lune convex polygon P is a special convex polygon 
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such that its diameter is the diameter of a regular lune and it lies entirely in 
the lune. (A regular lune is the intersection of two circles passing each others’ 
centers, i.e., the diameter of a regular lune is -\/3 • r for radius r.) A semi-circled 
convex polygon P is a special convex polygon such that its longest edge is the 
diameter and all the edges of P lie inside the semi-circle with this longest edge 
as diameter. (Refer to Figure 1). 

The following two properties are easy to verify for a semi-circled convex 
polygon P = {pi,P 2 , -,Pk, ...,Pn-l,Pn)- 

Property 1: Let PiPk for l<i<k<n he a,n edge i n P. Then the area 
bounded by piPk and chain {pi, ...,pk) is a semi-circled convex polygon. 

Property 2: In P, edge pfpj for 1 < i < j < n or 1 < i < j < n is shorter 
than piPn- The ‘spoke-scan’ approximation algorithm proposed in [4] can be 
described as follows. Let Ds = 'SiSj be the diameter of a point set S. The line 
extending Ds partition S into and (excluding Si and Sj). Let to{El) and 
oj{Ej) (respectively, oj{Ef) and oj{Ej)) denote the sum of the lengths of edges 
connecting every points in to Si and to Sj, respectively. These edges are 
called ‘spokes’. The algorithm first takes the spokes in the larger one of u>{El) 
and uj{Ej) (respectively, the larger one of oj{Ef) and uj{Ej)) as the first subset of 
the triangulation edges. Then, it uses Graham’s scan algorithm to complete the 
triangulation. It is shown that this triangulation has weight at least {n 1)-^. 
We shall use this algorithm in our paper. 

Property 3: The diameter of semi-lune convex polygon P belongs to the 
MAT{P). 

Proof: A brief idea of the proof is as follows. Suppose for contridiction that a 
MAT{P) contains edge piPk which crosses d iameter psPt (refer to Figure 1). 
Note in a regular lune that the longest edge in any triangle spanning on the 
boundary of and Ps (or pt) must be incident at Ps (or pt). Then, both the longer 
edge of psPi and ptPi and the longer edge of PsPk and ptPk must be longer than 
or equal to PiPk- Then, we can construct a new triangulation T(P) including 
diameter psPt an d the weight of T{P) would be large than that of MAT{P). 

□ 



2 Tight Upper-Bound 

for Maximum- Weight Triangulations 

The intuitive idea of proof is as follows. The area of a point set is bounded 
above by its diameter D (i.e., f-P^). In order to maximize the edge lengths of 
a triangulation of the set, one may wish that a triangulation contains as many 
long and thin (called ‘large’) triangles as possible. However, the area of a triangle 
and its perimeter are related (by Heron’s formula) . Thus, the number of large 
triangles with perimeter greater than 2D is limited above by the area of the 
point set. Consequently, we can bound the total edge-length of any MAT of the 
set through the estimation of the number of large and the remaining triangles. 
(Note that constant e is functioned as the shape parameter of a triangle.) 
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Let Ds denote the diameter of a finite point set S in the plane, let As denote 
the area bounded by the convex hull CH{S), and let Lg denote the perimeter 
of CH{S). Let constant e be in range 0 < e < i. 

It is known in [1,2] that the values of As and Lg are bounded above by f-Dg 
and TrDg, respectively. 

The following lemma demonstrates the relationship among the area of a tri- 
angle, Aa, the perimeter of the triangle, w(A), and the diameter of the triangle. 
Da- 

Lemma 1. For a given constant e: 0 < e < ^, if perimeter uj{A) satisfies: 
w(A) > (2 -I- 2e)Z?A, then the area Aa > eVl — • Da- 

Proof: Let a, b, and c be the lengths of the three edges of a triangle A. We shall 
first prove that if A a is minimized, then two of a,b, and c equal Da, and the 
third one equals 2e. 

Note that, when w(A) > {2 + 2e)DA and Aa is minimized, the equality: 

a-|-fo-l-c=(2-|- 2c)Da (1) 

must hold. Otherwise (i.e., a + b + c > {2 + 2 €)Da), we can always “shrink” the 
triangle to reduce its area while maintaining w(A) > {2 + 2c)Da and without 
reducing Da until the equality (1) holds. 

Let I = By Heron’s formula, A a = \/l{l — a){l — b){l — c). 

Without loss of generality, we fix a = Da- We claim that if A a is minimized, 
then 6 or c is maximized, that is, one of b and c equals Da - In fact, the following 
equality holds. 

(; _ b){l - c) = (W-b) + (l-c)?-iil-b)-{l-c))^) ^ {{2l-{b+c))^-{b-c)^) ^ 

Then, 

. 4 . . + f 

Since I and a are fixed, (6 + c) is also fixed. So when Aa is minimized, (b — c)^ 
in (2) must be maximized, that is, 6 or c must equal Da- 

Therefore, when A a is minimized, two of a, b, and c equal D triangle, and by 
a-|-6-|-c = (2-1- 2e)I?A, the third one equals 2eD triangie- 

By (1) and definition of ?, ? = (1 -I- e) • Da- Without loss of generality, let 
b = Da, we have A a = \/{^ + ^)Da • cDa ■ cDa ■ (1 — c)Da = eVl — A-D\- 
We now have A a > e\/l — for 0 < e < 5. □ 

In a triangulation of S, T{S), we call a triangle t ‘large’ if the perimeter of 
t, oj{t), is at least (2 -I- 2e) • Ds; and t is ‘small’, otherwise. By Lemma 1 and by 
the fact that the maximum area of the convex hull of a point set with diameter 
Ds is bounded above by [1,2], and note that Da is at most Ds, we have 
an upper-bound for the number of large triangles in T(S)- 

Lemma 2. T{S) may contain at most large triangles- 

Now we can derive the upper-bound of the length, uj{T), of T(S'): 
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Theorem 1. uj{T) < ((2 + 2e) • n + + f - 5(1 + e))Ds. 

Proof: Let m denote the number of large triangles. It is well known that T{S) 
contains at most 2n — 5 triangles. Thus, T{S) has at most 2n — b — m small 
triangles. It is clear that the perimeter of a large triangle is at most 3Ds, and 
the perimeter of a small one is less than (2 + 2e) • Ds by definition. If we add up 
the perimeters of all the triangles in T{S) as well as the perimeter of CH{S), 
Ls, we obtain a sum of the lengths equals twice of (^(T). Thus, 
w(T) < i((2 + 2e)(2n - 5 - m)Ds + 3mDs + Ls) 

< i((2 + 2e)(2n - 5 - + 3 + n)Ds 

(note that m < 0 < e < 5, and Ls < ttDs) 

= ((2 + 2e)Ds ■ n + (21^1=^ | _ 5(1 _|_ e))Ds- 

□ 

Remark 1: Note that the above upper-bound depends on parameter e. When 
e = 2, our bound is very close to an obvious upper-bound, 3n — 6 for Us = 1. On 
the other hand, when e is sufficiently small, the upper-bound in Theorem 1 is 
the best possible one with the measurement of Ds- This is because there exists 
a point set whose MAT has weight (2 — 'L^-)Ds ■ n — 3{Ds — (-\/3 -I- l)i5) for 
any constant 5 > 0. When S is sufficiently small, the weight is approaching to 
the upper-bound. 

In more detail for the claim above, let us consider the following point set. 




Fig. 2. An illustration of the tight upper-bound (part(a)), and a bad case for ‘spoke- 
scan’ algorithm (part (b)). 



We start to place three points in the vertices of a equilateral triangle, say 
si,S2, and S3. Let o be the center of the circle inscribing at AS1S2S3. Now, we 
place a group of ^ — 1 points evenly on line segment osi such that the distance 
from Si to the farthest point in this group is 6. Similarly, we place f — 1 points 
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on line segment 0 S 2 and ^ — 1 points on line segment 0 S 3 by the same manner. 
(Refer to part(a) of Figure 2.) To construct a triangulation, we connect all points 
in the neighboring groups by edges. Thus, we obtain a triangulation T' with n — 3 
short edges and 2n — 3 long edges, where the sum of the lengths of the shorter 
edges is no less than 3(5 and the sum of the lengths of the long edges is large than 
(2n — 3) • {Ds — -\/3(5). Then, the total length of the triangulation, uj{T'), is at 
least {2n-i){Ds-V36) + 3S. Then, lo{T') > 2nDs-2nV3S-3Ds + -5V36+3S 
= {2-^)Ds-n-3{Ds-{V^+l)S). □ 

Recall that the approximation ‘Spoke Algorithm’ produces a triangulation 
with length at least (n + 1 ) • 

If we apply this spoke algorithm to produce a triangulation of any point 
set, then by our upper-bound, we obtain a performance ratio as stated in the 
following theorem. 

Theorem 2. 4 -|- 4e for 0 < e < where T'{S) is produced by 

‘spoke-scan’ approximation algorithm and u>{T{S)) is the upper-bound in Theo- 
rem 1. 



Proof: 



^(T(S)) 



2n(l-He)Ds-K^^^^-hf-5(l-He))-Ds 



(4 -I- 4e) -I- where 



5(e) = + 7T — 14(1 -I- e) . When n is sufficiently large, we obtain the desired 

ratio. □ 



Remark 2: When the value of e is properly chosen, the 4-|-4e ratio will apply to 

2n(l+e)Ds + ( -Hf -5(l-ee))-Ds 

any n. To see this, let > < 4-|-4e, that is, , 

< (4 -I- 4e) for any n > 0. We have that 2(21^^=^ + f ~ ^(1 -be)) < 4 -b 4e after 
simplification. Solving this inequality for e, we have e > .05932390326. 

Therefore, when we apply ‘spoke-scan’ algorithm to any point set with n > 3, 
we have an approximation ratio of 4.238. □ 



3 Finding MAT of Semi-lune 

and Semi-circled Convex Polygons 

We first shall show an O(n^) dynamic programming algorithm for finding the 
MAT{P) of semi-circled convex polygon P with n vertices. 

Lemma 3. Let P = (pi,p 2 , ■■■,Pk, ■■■,Pn-iiPn) be a semi-circled convex polygon. 
Then, one of the edges: piPn-i and PnP 2 must belong to its maximum weight 
triangulation MAT{P). 

Proof: Suppose for contradiction that none of the two extreme edges: piPn-i and 
PnP 2 belongs to MAT{P). Then, there must exist a vertex for 2 < A: < n — 1 
such that both pipk and PnPk belong to MAT{P). Without loss of generality, 
let pk lie on one side of the perpendicular bisector of edge piPnj say the lefthand 
side (if Pk is on the bisector, the following argument is still applicable), the two 
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Fig. 3. 



edges piPk and PnPk partition P into three areas, denoted by i?, L, and C. Both 
L and R are semi-circled convex polygons by Property 1. (Refer to part (a) of 
Figure 3.) 

Let the edges of MAT{P) lying inside area L be (ei, 62 , Cfe-a) and let 
El = (ei,e 2 ,...,efc_ 3 ,PiPfc). Let denote the edges: p^, ■■■,PnPk-i)- 

Then, there is a perfect matching between El and E’^. This is because El and 
E’^ respectively triangulate the same area L U C, hence the number of internal 
edges of the two triangulations must be equal. Let us consider a pair in the 
matching (ei,PnPj) for 1 < i,j < k. It is not hard to see that ui{ei) < u>{n,j) 
because any edge in El is shorter than piPk by Property 2, any edge in is 
longer than PnPk, and PnPk is longer than piPk (due to pk lying on the lefthand 
side of the perpendicular bisector of piPn)- Therefore, uj{El) is less than uj{E*j^). 
Now, we shall construct a new triangulation, say T{P), which consists of all 
the edges in MAT{P) except replacing the edges of El by We have that 
ui{T{P)) > lo{M AT{P)), which contradicts the MAT{P) assumption. Then, 
such a Pk cannot exist and one of piPn-i and PnP 2 must belong to M AT {P). 

□ 

By Lemma 3 and by Property 1, we have a recurrence for the weight of 
a MAT{P). Let denote the weight of the M AT{Pi j) of a semi-circled 

convex polygon Pij = (pi,pi+i, ...,pj). Let denote the length of edge 

PiPJ- 






Lo{i,i + 1) 

max{uj{i,j — 1) +u>{j 



i = (* + 1) 

+ l,j) + -I- 1)} -I- otherwise 



It is a straight-forward matter to design a dynamic programming algorithm 
for finding the MAT(P). 

ALGORITHM MAT - PIND{P) 

Input: a semicircled convex n— sided polygon: (pi, ...,p„). 

Output: MAT{P). 
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Method: 

1. for i = 1 to n — 1 do 

a>(z, z + 1) = u>{i, i + 1) 

2. for I = 2 to n — 1 do 

3. for i = 1 to n — ^ do 

u;(z, i + 1) = max{uj{i, z + ? — 1) + uj{i + I — l,i + 1), 
uj{i + 1, z + /) + i^(z, z + 1)} +uj(z, i 1') 

4. Identify the edges of MAT{P) by checking the w’s. 

5. end. 

Since the loop indices i and I range roughly from 1 to n and each evaluation 
of uj{i,j) takes constant time, all w(z,j) for 1 < z, j < n can be evaluated in 
O(n^) time. If we record these w’s, we can find the edges in MAT{P) by an 
extra 0(n) time to examine the record. 

Therefore, we have the following theorem. 

Theorem 3. The maximum weight triangulation of a semi-circled convex n— gon 
P, MAT{P), can be found in 0(n^) time. 

Proof: The correctness is due to Property 1. It is clear that the number of 
executions of Step 3 dominates the time complexity. The number of executions 
is (zz — 3) + (zz — 4) + ... + 2 + 1) G O(n^). □ 

By Property 3 and Theorem 3, we have that 

Corollary 1. The maximum weight triangulation of a semi-lune convex n—gon 
P, MAT{P), can he found in O(zz^) time. 

4 Conclusion 

In this paper, we derived a non-trivial tight upper-bound of the maximum weight 
triangulation of a point set in the plane. We also used ‘spoke-scan’ algorithm 
to obtain a factor 4.238 triangulation for any point set in the plane, which 
improved the previous result: factor 6. We finally proposed an O(n^) dynamic 
programming algorithm for constructing the MAT(P) of a semi-lune or a semi- 
circled convex zz— sided polygon. 

It is interesting to see what is the range of the performance ratios of the 
spok e-scan algorithm. There is a good case in which the algorithm produces an 
optimal solution of (-j -I- i5)(rz -I- 1)) for the following point set. 

Given any constant i5 > 0, we place n points on the unit interval. Two of 
them locate at the ends of the interval and the rest are in interval (^ — (5, ^-|-i5). 
Then raise the points near the center slightly so that the n points form a convex 
polygon, and every point near the center is at most ^ -I- i5 to any of the end 
points. (Refer to Figure 4.) It is not difficult to see that any triangulation of 
these n points has length at most (-y + <5)(zz -I- 1)). 

There is a bad case in which the spoke-scan algorithm produces a solution 
with ratio 3 (refer to part (b) of Figure 2). For the point set in part (a) of 
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n -2 



Fig. 4. An illustration of the good case. 



Figure 2, we slightly perturb it so that the sum of the lengths of spokes from si 
is the largest among all the cases. We then arrange each group of j points into a 
convex position. We finally move points Sa and Sf, into positions so that during 
the scan stage, there is no long edge created from the internal points of these 
groups on the convex side. It is not difficult to see that the optimal solution is 
(2n — 3)(Z3s — ■\/3<5) + 36 and the spoke-scan algorithm produces a solution of 
'^nDs + 4:Ds + n6. The ratio is approaching to 3 for sufficiently small 6 and large 
n. 

It is still an open problem whether one can design an o{n^) algorithm for 
finding the MAT{P) for a general convex n— sided polygon P. 
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Abstract. An octree is a recursive partition of the unit cube, such that in each 
step a cube is subdivided into eight smaller cubes. Those cubes that are not further 
subdivided are the leaves of the octree. We consider the problem of coloring the 
leaves of an octree using as few colors as possible such that no two of them 
get the same color if they share a face. It turns out that the number of colors 
needed depends on a parameter that we call unbalancedness. Roughly speaking, 
this parameter measures how much adjacent cubes differ in size. For most values 
of this parameter we give tight bounds on the minimum number of colors, and 
extend the results to higher dimensions. 



1 Introduction 

Octrees are a fundamental data structure to store any three-dimensional data in such a 
way that search and insert operations can be performed efficiently. Therefore, octrees 
are frequently used, for instance in computational geometry and computer graphics (see 
e.g. [5]). 

An octree is a binary partition of the unit cube. More precisely, an octree is obtained 
by starting from a single cube, and recursively subdividing cubes into eight smaller 
cubes of equal size. The structure of an octree can be seen as a tree, where each node 
represents a cube in the subdivision process. If a cube is further subdivided, the cor- 
responding node in the tree has eight children. The cubes associated to children nodes 
are also referred to as sub-cubes of the cube that corresponds to the parent node. An 
example of an octree is shown in Figure 1(a). 

We say that two cubes in an octree 2 xe, face-adjacent iff they share a two-dimensio- 
nal face (that is, a face of one of the cubes contains a face of the other cube). Similarly, 
two cubes are called edge-adjacent iff they share a one-dimensional face, and vertex- 
adjacent iff they share a zero-dimensional face. Whenever the term “adjacent" appears 
without further qualihcation in the following, we refer to face-adjacency. 

The purpose of this paper is to analyze the problem of coloring the leaves of octrees 
(i.e., those cubes that are not further subdivided) using as few colors as possible, such 
that no two adjacent cubes receive the same color. Figure 1(b) shows a proper coloring 

* This work was initiated at the 1st Gremo Workshop on Open Problems (GWOP), held at Hof 
de Planis, Stels, Switzerland, Inly 8-11, 2003. 

** Supported by the joint Berlin-Ziirich graduate program “Combinatorics, Geometry, and Com- 
putation”, financed by the German Science Foundation (DFG) and ETH Zurich. 

K.-Y. Chwa and J.I. Munro (Eds.): COCOON 2004, LNCS 3106, pp. 62-71, 2004. 

(c) Springer- Verlag Berlin Heidelberg 2004 
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(b) A proper 3-coloring. 



Fig. 1. An octree. 



of an octree using three colors. In all figures appearing in this paper we indicate the 
color of a cube by coloring its bottom face. 

In an octree, adjacent cubes may differ in size. To capture this property, we introduce 
a parameter that measures the unbalance. An octree is said to be k-unbalanced, if the 
edge lengths of neighboring cubes differ by at most a factor of 2^. A 0-unbalanced 
octree is called balanced. If there is no condition on the balance of adjacent cubes, we 
call an octree oo-unbalanced, or just unbalanced. As it turns out, the number of colors 
needed for coloring an octree depends on this parameter. 

1.1 Related Work 

The problem of coloring octrees has been formulated by Benantar et al. [1]. It is a 
natural generalization of the corresponding problem for quadtrees, which are the two- 
dimensional counterparts of octrees obtained by recursively subdividing a square into 
four congruent smaller squares. 

For quadtrees the coloring problem appears in the context of solving linear elliptic 
partial differential equations on shared-memory parallel computers [2]. The basic idea is 
the following. If the squares of a quadtree are colored with respect to vertex-adjacency, 
then the squares that belong to one particular color class are spatially well separated 
and can therefore be processed in parallel without conflict. They present a linear-time 
algorithm for coloring 1-unbalanced quadtrees with at most six colors. For unbalanced 
quadtrees an algorithm using at most eight colors is given by Benantar et al. [1]. 

Recently, Eppstein, Bern and Hutchings [3] proved several results on coloring quad- 
trees with respect to edge-adjacency. They prove that balanced quadtrees can be colored 
with three colors, and that unbalanced quadtrees can be colored using at most four col- 
ors. Both bounds are tight in the sense that some quadtrees require this many colors. 
They also improved the bound for coloring the squares of unbalanced quadtrees with 
respect to vertex-adjacency from eight to six colors, and constructed a quadtree that 
requires at least five colors. 

1.2 Our Results 

In this paper we mainly concentrate on octree coloring problems with respect to face 
adjacency, and quadtree coloring problems with respect to edge adjacency. 
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However, recursive subdivisions of cubes are obviously not restricted to dimensions 
two and three. Since most of our results naturally extend to cubes of arbitrary dimen- 
sion, we present them in a more general setting. The following definition provides the 
generalization of octrees to arbitrary dimensional structures, which we call 2‘^-trees. 

Definition 1. For d G IN, a 2‘^-tree is a data structure formed by starting from a single 
d-dimensional cube and recursively subdividing d-dimensional cubes into 2'^ smaller 
d-dimensional cubes of equal size. 

Let k be a non-negative integer. A 2'^ -tree is called fc-unbalanced iff any two cubes 
that share a face (that is, a face of one of the cubes contains a face of the other cube) 
are within a factor of 2^ in edge length. 

By coloring a 2‘^-tree we mean coloring its leaves, that is, those cubes that are not 
further subdivided. The coloring of a 2‘^-tree is called proper iff any two cubes that 
share a face are colored with different colors. We say that a 2^^-tree is k-colorable iff 
there exists a proper coloring for its leaves using at most k colors. 

The first observation that can be made is that a balanced 2‘^-tree is 2-colorable, for 
every c? € IN. Indeed, since all cubes in a balanced 2^^-tree have the same size, it can be 
represented as a hnite set of cells of the axis parallel integer grid subdivision in The 
cells of this subdivision can be colored with two colors, by coloring all cells that have 
even sum of their coordinates with color one, and all the others with color two. We will 
refer to this coloring as the coloring in checkered fashion. 

Our results are presented in Table 1. For each d and k, the corresponding entry c in 
the table gives the minimal number of colors needed to color a fc-unbalanced 2‘^-tree. 
This means that, on one hand, we provide an algorithm for coloring ^-unbalanced 2'^- 
trees with c colors and, on the other hand, we give an example of a /c-unbalanced 2‘^-tree 
which cannot be colored with c — 1 colors. 

The paper of Eppstein, Bern and Hutchings [3] already settles this question for 
d = 2, k = 1, and d = 2, k = oo. These two results are marked bold in the table. 



Table 1. Minimal number of colors needed to color a fc-unbalanced 2‘^-tree. 



d\k 
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4 
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4 
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4 
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4 
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<6 
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<6 
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<6 


<8 








9 



Obviously, the values in the rows of the table are nondecreasing. The same holds for 
columns, since by using the structure of a fc-unbalanced 2^^-tree which is not c-colorable 
one can easily construct a fc-unbalanced 2'^+^ -tree which is not c-colorable. These prop- 
erties set upper and lower bounds in the cases in which we could not determine the exact 
number of colors needed (empty cells). 
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2 Lower Bounds 

Bern, Eppstein and Hutchings [3] gave a neat algorithm to color any 1-unhalanced 
quadtree using at most three colors. Moreover, the color of each square in the quadtree 
can he set without knowledge of the subdivision. 

For 1-unbalanced octrees, it turns out that three colors are not enough. For the proof 
of this claim we need the following two lemmata. 

Lemma 2. Let A, B and C be congruent cubes such that both B and C share a face 
with A, and B and C share an edge. Moreover, let the cube A be further subdivided 
into eight cubes (see Fig. 2(a)). 

If these cubes are properly colored with three colors, then B and C have to get the 
same color, and the six sub-cubes of A that are adjacent to B or C have to be colored 
in checkered fashion with the remaining two colors. 

Proof. Suppose that the arrangement of cubes in Figure 2 is properly 3-colored. We 
denote the two sub-cubes of A that are adjacent to both B and C by A\ and A 2 . Since 




c 




(a) Configuration. 




(b) Forced Coloring. 



Fig. 2. Illustration for Lemma 2. 



B, Ai and A 2 are pairwise adjacent they all have to be colored with different colors. We 
can assume without loss of generality that B is colored with color 1, Ai with color 2, 
and A 2 with color 3. Since C is adjacent to both Ai and A 2 , it has to be colored with 
color 1. Hence, each of the remaining four sub-cubes of A that are adjacent to B or 
C has to be colored either with color 2, if it is adjacent to A 2 , or with color 3, if it is 
adjacent to Ai. That gives exactly the coloring of the six sub-cubes of A by colors 2 
and 3 in checkered fashion, as shown in Figure 2(b). □ 

Lemma 3. Let A, B, C and D be congruent cubes, such that all of B, C, and D share 
a face with A, and that B and C do not share an edge. Moreover, let cube A be further 
subdivided into eight cubes ( see Fig. 3 ). 

If these cubes are properly colored with three colors, then B, C, and D get the same 
color, and the eight sub-cubes of A are colored in checkered fashion with the remaining 
two colors. 
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(b) Forced Coloring. 



Fig. 3. Illustration for Lemma 3. 



Proof. Apply Lemma 2 twice, once to the cubes A, B, and D, and then to the cubes A, 
C, and D. □ 

Note that a collection of cubes can be colored in checkered fashion with two fixed 
colors in two different ways. Using the configurations from the last two lemmata as 
building blocks, we give a lower bound on the number of colors needed to properly 
color 1 -unbalanced octrees. 

Theorem 4. There exists a l-unbalanced octree that is not i-colorable. 

Proof. We show that the l-unbalanced arrangement of cubes presented in Fig. 4 cannot 
be colored with three colors. Some cubes of the octree are omitted, since they will not 
be needed for the analysis. However, they can be easily added without violating the 
1-unbalancedness. Note that the cubes that appear in the example are of three different 
sizes — we will refer to them as big, medium and small cubes. Denote the two big cubes 
that are subdivided by A and B, and denote the two medium sub-cubes of A that are 
subdivided into small cubes by Ai and A 2 , as shown in Fig. 4(a). 

Now, suppose there is a proper 3-coloring. First, we look at the two adjacent big 
cubes at the back of the arrangement (behind A and B). They have to be colored with 
different colors. We can assume without loss of generality that the top cube is colored 
with color 1 and the one below is colored with color 2. 

Then, apart from A there are three big cubes adjacent to the subdivided cube B. 
This arrangement enables us to use Lemma 3. Since we know that the cube behind B 
has color 1, the other two big cubes should also be colored with color 1. Further, the 
sub-cubes of B have to be colored in checkered fashion with colors 2 and 3. 

Finally, the sub-cubes of A (except Ai and A 2 ) together with the two big cubes 
behind and below A form the configuration from Lemma 2. Since the big cube behind 
A is colored with color 2, the other one should have color 2 as well. The six medium 
sub-cubes of A have to be colored in checkered fashion with colors 1 and 3. 

It remains to prove that this coloring is not proper. Consider the subdivided cube Ai 
and its three neighboring medium cubes that are shaded in Fig. 4(b). By Lemma 3 the 
medium cubes above and below Ai both have the same color. Similarly, the medium 
cubes above and below A 2 both have the same color. But, we already know that the two 
cubes above Ai and A 2 are colored in checkered fashion using colors 2 and 3, whereas 
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(a) The construction 




(b) Two configurations to apply Lemma 2. 

Fig. 4. A 1-unbalanced octree that is not 3-colorable. 



the two cubes below Ai and A 2 are colored in checkered fashion using colors 1 and 3. 
Contradiction. □ 



Theorem 5. There exists a d-unbalanced 2'^-tree that cannot be {d + l)-colored. 
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(b) Remaining colors are forced. 



Fig. 5. A 2-unbalanced quadtree that is not 3-colorable. 



Proof. We are going to provide examples of a 2-unbalanced quadtree that requires four 
colors and a 3-unbalanced octree that requires five colors. The latter example can be 
generalized to prove the statement in higher dimensions. 

First, we consider the 2-unbalanced quadtree as shown in Fig. 5. Note that some 
squares of the depicted quadtree are omitted, as they will not be needed for the analysis. 
They can be easily added without violating the 2-unbalancedness. 

Assume that this quadtree can be 3-colored. The three squares marked in Fig. 5(a) 
are pairwise adjacent, and, hence, have to be colored differently. If we proceed to color 
those squares for which the color is uniquely determined, we end up with the coloring 
depicted in Fig. 5(b). Obviously, this cannot be extended to the last square. Contradic- 
tion. 

Next, we exhibit a 3-unbalanced octree T that cannot be colored with four colors. 
The construction is shown in Fig. 6(a). 

Assume that this octree can be 4-colored. The four cubes marked in Fig. 6(a) are 
pairwise adjacent, and, hence, have to be colored differently. If we proceed to color 
those cubes for which the color is uniquely determined, we end up with the coloring 
depicted in Fig. 6(b). Obviously, this cannot be extended to the last cube. Contradiction. 

In the following, we use the octree T to construct corresponding examples in higher 
dimensions. First, we show how to construct a 4-unbalanced 2^-tree T' . Start with a 
4-dimensional cube C. A copy of T is then placed onto an arbitrary facet of C. We 
extend (“grow”) each of the 3-dimensional cubes in the copy of T to the 4-dimensional 
cube, such that it does not overlap with the cube C. 

The obtained conhguration of 4-dimensional cubes can be easily completed (by 
adding more cubes) to a 2'*^-tree T'. Since the octree T has only four levels of subdivi- 
sion, T' is a 4-unbalanced 2^-tree. 

The cubes in T' obtained from cubes in T are adjacent to one another the same way 
as in T, and in addition they are all adjacent to the cube C. Since T cannot be colored 
with four colors, T' cannot be colored with five colors. 



Coloring Octrees 



69 




(a) Initial 4-clique. 




(b) Remaining colors are forced. 

Fig. 6. A 3-unbalanced octree that is not 4-colorable. 



This construction can be repeated in the same fashion, each time increasing the 
dimension by 1. This way, for every d > 3 we obtain a d-unbalanced 2‘^-tree that 
cannot be (d + l)-colored. □ 
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3 Algorithms for Coloring 2'^-trees 

We proved that at least four colors are needed to color a 1-unbalanced octree. Some- 
what surprisingly, four colors are enough even for a 1-unbalanced 2‘^-tree of arbitrary 
dimension. 

Theorem 6. Every k-unbalanced 2'^ -tree is (2k -f 2)-colorable, for k > 0, d > 2. 

Proof. We color the cubes of the fc-unbalanced 2‘*-tree according to their size — all the 
cubes with edge length 2~^ are colored in checkered fashion using the colors (2£ mod 
2k + 2) and (2£ -f 1 mod 2k 2), for every non-negative integer £. 

It remains to be shown that this coloring is proper. Two cubes of different size that 
are colored with the same color have edge ratio at least 2^+^, and therefore they cannot 
be adjacent in a Ic-unbalanced 2‘^-tree. If two adjacent cubes have the same size then 
they are colored with different colors by definition of the coloring in checkered fashion. 
Hence, the coloring is proper. □ 

The upper bound we just obtained is tight in some cases, but not in general. The 
theorem below provides a better upper bound, if d < 2k. 

Theorem 7. Every 2'^ -tree is (d -f 2)-colorable, for d> 2. 

Proof. We follow the idea presented in [3]. First, we assign an arbitrary color to the unit 
cube. Then, we subdivide it recursively assigning colors to the newly obtained cubes — 
in step t we color all cubes of edge size 2“T Note that this way cubes are colored even 
if they are subdivided. Eventually, we obtain a coloring of all leaves of the 2‘^-tree. 

For each subdivision of a cube of color c into 2'^ sub-cubes, we color them as fol- 
lows. A pairwise non-adjacent half of them (one color class of the checkered 2-coloring) 
is colored with color c. After that each of the cubes in the other half has d neighbors 
of color c and at most d additional neighbors of arbitrary color (since at that moment 
no cube of smaller size has been colored). Altogether, there are at most d -f 1 different 
colors in its neighborhood. Hence, one of the d -f 2 colors can be used to color it. □ 
Note that the existence of a coloring for the case d = 2 (quadtrees) follows from 
the four-color theorem for planar graphs [4]. 

4 Open Problem s 

It remains to determine the exact number of colors needed to color 2-unbalanced octrees 
(Is it four or five?). We conjecture that four colors are enough. 

Another interesting direction is to investigate the octree coloring problems with 
respect to edge/vertex-adjacency, where to the best of our knowledge no results have 
been obtained so far. 
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Abstract. Bricks are polyominoes with labelled cells. The problem of 
whether a given set of bricks is a code is undecidable in general. It is 
open for two-element sets. Here we consider sets consisting of square 
bricks only. We show that in this setting, the codicity of small sets (two 
bricks) is decidable, but 15 bricks are enough to make the problem un- 
decidable. Thus the frontier between decidability and undecidability lies 
somewhere between these two numbers. Additionally we know that the 
codicity problem is decidable for sets with keys of size n when n — 1 and, 
under obvious constraints, for every n. We prove that it is undecidable 
in the general case of sets with keys of size n when n > 6. 



1 Introduction 

Tiling problems, including polyomino tilings, are widely studied in the literature 
(e.g. [1,2]). In this paper we study decidability properties of brick codes which 
are a natural extension of polyomino codes. 

Let A be a finite alphabet of labels. A brick is a partial mapping k \ 7? ^ A, 
where domfc is finite and connected. It can be viewed as a polyomino with its 
cells labelled with the symbols of A. If |A| = 1, there is an obvious natural 
correspondence between bricks and polyominoes. The set of all bricks over A is 
denoted by A^. 

Given a set of bricks X C A'^, the set of all bricks tilable with (translated 
copies of) the elements of X is denoted by A^. Note that we do not allow 
rotations of bricks. X C A'^ is a brick code, if every element of X'^ admits 
exactly one tiling with the elements of X . 

The effective alphabet of A C A^ is the set of all symbols that appear on 
bricks in A, i.e., Ufcejc ^(domfc). If k G A'^ is a square brick, then by lenfc we 
denote the edge length of k, i.e., y/|domfc|. 

Given a rectangular brick t G A'^, by we denote a brick obtained by 
stacking together p copies of t vertically, and then q copies of this compound 
horizontally. 

The problem of whether a given set of bricks (even polyominoes) is a code is 
undecidable in general. This can be proved by reduction from the Wang tilabilty 
problem (see [2,8]). The problem is open for two-element sets. In this paper we 
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consider sets consisting of square bricks only. We show that in this setting, the 
codicity of small sets (two bricks) is decidable, but 15 bricks are enough to make 
the problem undecidable. 

Note that the codicity problem is trivially decidable if the squares use just 
one label, i.e., when they are effectively polyominoes. Only singleton sets are 
codes then. Thus in the sequel we consider sets of square bricks with an effective 
alphabet of at least two symbols. 

Also note that apart from the single-label case, the decidability of sets of 
squares does not depend on the size of the effective alphabet, since a larger 
alphabet can be “simulated” with two symbols at the expense of the size of the 
squares. When arbitrary shapes are considered, shapes can be used to simulate 
labels, thus making brick decidability equivalent to polyomino decidability. 

In the final section of the paper we investigate the decidability problem for 
sets with keys, i.e., squares having different top-left “prefixes” of a given size. 



2 The Case of Two Square Bricks 

We now consider sets consisting of just two bricks, each of them being a square 
(in the geometrical sense). We show that there exists a simple algorithm to verify 
whether a given set of this kind is a brick code. 

Proposition 1. Let X = {k,l} C where domfc and domZ are squares, 
k ^ 1. Then X is not a brick code iff k and I have a common rectangular tiler, 
i.e., there exists a rectangle t € such that k,l € 

Proof. The “if” part is obvious. Now, if |domfc| = |domZ|, then k and I have 
identical shape and differ in their labelling, so X is obviously a code with no 
common tiler for k and 1. Thus we are left with the case, e.g., |domfc| < |domZ|. 

Assume that X is not a brick code. There exists y G such that y admits 
two different tilings with the elements of X. 

Take the leftmost cell in the uppermost row of y. If both tilings use the 
same square x G {k,l} to cover this cell, then remove x from y and repeat this 
procedure until y' is obtained such that the two tilings place different squares in 
the leftmost cell in the uppermost row of y' . In other words, we take y to be a 
minimal brick admitting two tilings over X . Call the tilings and Ti, according 
to the square being used to cover the cell specified above. 

Now this implies that k tiles the top-left corner of 1. Moving rightwards 
along the top edge of Z, we observe that when covers the remaining part of Z, 
it cannot place a tile higher than the top edge of Z, since we have started in the 
uppermost row. Thus the next tile to the right of k has to be aligned along the 
top edge. Since we already know that Z contains a copy of k in its top-left corner, 
we have another copy of k “along the way” (although Z may have actually been 
used). Continuing in this way, we observe that the two tilings will eventually 
arrive at a common right edge when they reach the lowest common multiple of 
the edge lengths of k and Z. 




74 



Malgorzata Moczurad and Wlodzimierz Moczurad 



ki . 


.. kn 


ki . 


.. h 


\ki k2 


■ kn—i 


ki . 


■ ■ kn 


ki . 


.. h 


ki+i ki+2 .. 


■ kn 



Considering the situation to the right of the first copy of I in Ti and denoting 
the columns of k by ki, k 2 , fcn we obtain ki = fci+i, = ki+ 2 , ■■■, kn-i = kn 
where i = (len ^)mod(len fc) (cf. figure above). If z = 0, then lenfc is the width of 
the common tiler. Otherwise this width is equal to z. 

The above argument can be repeated in vertical direction, thus giving a 
square that can be tiled with k or with 1. The size of the square will be the 
lowest common multiple of the sizes of k and 1. □ 

Note that the proof becomes trivial if the effective alphabet of X is just 
one symbol, since X is never a code then with, e.g., the unit square being the 
common tiler for k and I 

Example 1. Consider X = {k,l} containing two bricks depicted below. They 
have a common tiler t, hence X is not a code. 



a 


b 


a 


b 


a 


b 


b 


b 


b 


b 


b 


b 


a 


b 


a 


b 


a 


b 


b 


b 


b 


b 


b 


b 


a 


b 


a 


b 


a 


b 


b 


b 


b 


b 


b 


b 



a 


b 


a 


b 


b 


b 


b 


b 


a 


b 


a 


b 


b 


b 


b 


b 



Proposition 2. If k,l G have a common rectangular tiler, then they have a 
common square tiler. 

Proof. Assume that k, I G {t}^, where t is a rectangle of size px q. Let r be the 
lowest common multiple of p and q. Both len k and len I have to be multiples 
of r. Hence, taken as the common square tiler. □ 

Corollary 1. Let X = {k,V\ C AX^, where domk and dom? are squares. It is 
decidable whether X is a brick code. 

3 The Case of 15 Square Bricks 

We show that a Thue system can be reduced to a brick code problem with square 
bricks. Choosing a small Thue system with an undecidable word problem, we 
obtain a set of 15 squares, thus proving that the codicity of a set containing 15 
square bricks is undecidable. 

There exists a non-erasing Thue system with an undecidable word problem 
over a two-letter alphabet with just three relations. This is the smallest example 
known to us, due to Matiyasevich [5,6]. We can encode this system, including 
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two arbitrary input words v and w, in a set X containing 15 square bricks and 
having the following property: v and w are equivalent (in the Thue sense) iff X 
is not a brick code. This implies undecidability of the codicity problem for 15 
squares. 

Consider a finite alphabet X and a finite subset S C X* x E*. The elements 
of S are called relations. A Thue system is the pair (X,S). For u,v € X*, we 
write u v if there exists a relation (s, t) G S' or {t, s) £ S such that u = xsy 
and V = xty for some x,y £ X*. By =s we denote the transitive closure of ~s. 
The word problem is: given u,v £ X*, does u =s v hold? 

For a given Thue system (X, S) and two words u,v £ X* we construct a set 
Xs,u,v of square bricks over an alphabet A = XU {*, >} in the way shown below. 
Cells which are left blank in the diagrams should be labelled with *. 

— For each relation (si...s„,ti...tm) £ S we draw two bricks. Note that if one 
of the words is shorter, it is left-padded with * symbols: 



■A: 5i i*r ... k S^i 




k ...k ti k ...k tfji 


k ...k t\ k ... tAr tm 




k Si k ... :Ar Sn 



— For the two words u = u\...Ui and v = v\...Vj we draw: 









★ 0 lAr V\ k .. 


,. * Vj 


■A: t> :Ar Ul k . 


.. * Ul 









— For each symbol a £ X U {o} we draw (“rewriting bricks”): 



OC 
'k (X 



— Finally, we draw an additional square (a “filler”): 0 



★ tAr ★ Q; 



★ Q lA: tAr 
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Proposition 3. Let {LJ,S) be a Thue system and let u,v G S* . Let Xs,u,v be 
the set constructed as described above. The following equivalence holds: u =s v 
iff Xs,u,v is not a brick code. 

Proof. (=J>) We first present a sketch of the proof, and then give the details. 

A successful derivation in the Thue system corresponds to a series of bricks, 
stacked vertically in layers: one of the input words, bricks corresponding to the 
first derivation step, bricks corresponding to the second step, the other input 
word. Adjacent bricks have the same labels along level boundaries. Rewriting 
bricks are used to shift non-* symbols to the left and to rewrite symbols that 
are not changed at each step. 

Now note that a tiling described above can also be constructed using the 
symbol-rewriting bricks and the filler. This implies non-codicity of the set. 

To analyze the details of this construction, consider u =s v. Let uJ denote 
the sequence of symbols *W 2 * ... where lu = 0 JiUJ 2 -.-<^n G LJ*. Using the 
Thue derivation, we construct a shape as follows: 

1. Initial lines are the brick corresponding to u: 



■k > -k U 



2. If the derivation includes m' u” , there exists a relation (s,t) such that 
u' = xsy and u" = xty. Consider the cases: 

~ If |s| = |t|, a brick is added as follows: 



I I 

I I 



'★ [> -k X k S k V' 

I ^ 
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— If |s| < |t|, * symbols have to be added before a brick that corresponds 
to (s,t): 
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— If |s| > |f|, a brick is added in a similar way but * symbols are removed 
afterwards. 

Note that in each case after a brick corresponding to (s,t) is added, the 
bottom row corresponds to u” . 

3. If the bottom row corresponds to v, the construction is complete: 



I* [> * f I 



■k t> -k V 



Another tiling for the shape being constructed can be made with the rewriting 
bricks and the filler (cf. Example 2). 

(<1=) If u =s V does not hold, there is no word u' such that u' v, and 
consequently it is not possible the complete the construction as in case 3. □ 



Note that the reduction works even if the Thue system is erasing. A semi- 
Thue system could also be used with a minor modification (no symmetrical 
relation bricks). Matiyasevich and Senizergues give an example of a semi-Thue 
system over a three-letter alphabet with three relations that has, e.g., un unde- 
cidable individual accessibility problem (see [7]). This leads to a similar result. 



Example 2. The Thue system of Matiyasevich contains a relation of considerable 
length; thus it is not well-suited as an example. To clarify the idea of the Thue- 
to-brick reduction, we present a Thue system with short relations. Let S = 
{{ab, b), {aa, bb), {aa, a)}. We ask whether bab =s a. The obvious answer is yes, 
thus the set of bricks X for this Thue system is not a code. 

The set X consists of the following bricks: 



~ bricks corresponding to the rules: 



k a k b 

-k 'k ~k 'k 
kc k: kr k: 

k k k b 



k k k b 
k k k k 
k k k k 

k a k b 



k a k a 

k k k k 
k k k k 

k b k b 



k b k b 

k k k k 
k k k k 

k a k a 



k a k a 

k k k k 
k k k k 
k k k a 



k k k Qj 
k k k k 
k k k k 

k a k a 



— bricks corresponding to the input words: 
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— rewriting bricks and the filler: 



~k ~k d 
-k 'k -k 'k 
-k k: kc k: 
k d k k 



k k k b 

k k k k 
k k k k 
k b k k 



tAr ★ 'Ar t> 
k k k k 
k k k k 
t> 'Ar ★ 



k d 
k d 



* b 

* b 



* [> 
* o 



0 



The following derivation corresponds to a tiling shown in Fig. 1. Underlined 
subwords are the ones being substituted at each step. 

bob OM a 




□□□□□□□ 

□□□□□□□ 


□□□□□□□ 
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Fig. 1. Tilings corresponding to bob ~s bb ~s aa ~s a 
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Corollary 2. The codicity problem for sets containing 15 or more square bricks 
is undecidable. 

Proof. The reduction of a Thue system with three relations over a two-letter 
alphabet produces six bricks corresponding to the relations, two corresponding 
to the input words, six rewriting ones, and the filler. □ 



4 Decidability of the Codicity Problem 
for Sets with Keys 

Given k G A^, by pref„(A:) we denote the n x n top-left subsquare of k. If 
lenfc < n, we define pref„(fc) = k. We say that X C is a set with keys of 
size n, if Vfc, ? G X : (pref„(fc) = pref„(Z) ^ k = l). 

Proposition 4. Let X C A^ be a set with keys of size n. If all bricks in X 
have edge lengths greater or equal to n, then X is a code. 

Proof. Obvious, since each brick has a unique top-left corner. □ 

Proposition 5. Let n > 6. It is undecidable whether a set with keys of size n 
is a code. 

Proof Let M = (Q, X, F, <5, qo) be a Turing machine, where Q is a set of states, 
E is a tape alphabet with a special blank symbol F is a set of terminal states, 
(5 is a transition function and qo is an initial state. We assume that the tape is 
infinite to the right, and after the input word there are only blank symbols on 
the tape. We can encode a Turing machine M in a set Xm,w with keys of size 6 
over an alphabet X U QU {>, *} in the following way. 

— For each instruction from S of the form (g^, x, y, R, qj) (meaning: if x is read 
in state qi, write y, move to the right and change state to qj) we draw a 
brick: 



■kqi -k X 

*y * Qj 



— For each instruction {qi,x,y, L,qj) and for each symbol a G X: 



■k a k qi k a; 



k qj k a k y 
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— For the input word w = ai...an' 




— Rewriting bricks for each symbol a G S U {>} and for each state q G Q: 

■k a k q 

k a k q 

— Additionally: 

-k ~k 

* vl 0 

The set Xm,w is not a code if and only if M stops on the word w. □ 

Note that sets where no square is a prefix of another square are obviously 
codes. 

Corollary 3. The codicity problem for sets with keys of size n is decidable for 
n = 1 and undecidable for n > 6. 

5 Conclusions 

What we know about the decidability of the codicity problem for square bricks 
amounts to the following: 

— If the effective alphabet is trivial, i.e., only one label is used, the problem is 
trivially decidable. 
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— If the effective alphabet contains at least two symbols, we have decidability 
for two-element sets and undecidability for sets with at least 15 elements. 

— The problem is trivially decidable for sets with keys of size n with all bricks 
having edge lengths of at least n. 

— The problem is undecidable in the general case of sets with keys of size n, 
for n > 6. 

We are of course interested in finding exact frontiers between decidability 
and undecidability (cf. [4]), both with respect to the number of bricks (3. ..14) 
and the size of keys (2. ..5). 
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Abstract. The Ulam-Renyi game is a classical model for the problem of 
determining the minimum number of queries to find an unknown number 
in a finite set when up to a hnite number of the answers may be erro- 
neous/mendacious. In the variant considered in this paper, questions 
with q many possible answers are allowed, with q fixed and known be- 
forehand; further, lies are constrained by a weighted bipartite graph (the 
“channel”). We provide a tight asymptotic estimate for the number of 
questions needed to solve the problem. Our results are constructive, and 
the appropriate searching strategies are actually provided. As an extra 
bonus, all our strategies use the minimum amount of adaptiveness: they 
ask a first batch of nonadaptive questions, and then, only depending on 
the answers to these questions, they ask a second nonadaptive batch. 



1 Introduction 

In the (?-ary Ulam-Renyi game two players, called Paul and Carole, first agree 
on fixing an integer M > 1 and a search space U = {0,...,M— 1}. Then Carole 
chooses a number x* S U , the secret number, and Paul must find out a;* by asking 
the minimum number of ( 7 -ary questions. By a ( 7 -ary question we understand a 
list To, . . . ,Tq-i of pairwise disjoint subsets forming a partition of the set U. 
The parameter q is fixed beforehand. When presented such a list, Carole will 
answer by pointing out the set Tk, supposedly containing the secret number 
a;*. It is however agreed that Carole may give up to e mendacious (erroneous) 
answers. The integer e > 0 is fixed and known to both players. Intuitively, any 
( 7 -ary question asks “Which set among Tq,Ti, . . . ,Tq-i does the secret number 
Xt belong to?” and the answer is just an index k € Q = {0, 1, . . . , q— 1}, meaning 
that a;* belongs to Tk- 

We generalize the ( 7 -ary game in the following way. Before the game starts, 
Carole and Paul fix a function T : Qx Q ^ IMo = {0,1,2,...} such that 

* This work was partially supported by INTAS 00-738 
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r(i, z) = 0 for each i, and r{i,j) > 0 for each i yf j. The function F is meant to 
weigh Carole’s answers. More precisely, whenever Carole answers j to a question 
whose sincere answer is i, we say that Carole’s answer has individual weight 
r(i,j). Note that every sincere answer has weight 0. The parameter e is meant 
to bound Carole’s lies via the following new rule: 

Carole is allowed to give false answers, with total weight (up to) e. 

Thus, if k is the honest answer to Paul’s first question T = {Tq,Ti, . . . ,Tq_i}, 
(in the sense that a;* G Tk) then Carole can choose her answer in the set {j : 
e — r{k,j) > 0}. If Carole answers i, with i in this set, then her available 
weight becomes e — F{k,i). And the individual weight of her answer is F{k,i). 
By induction, if k' is the correct answer to Paul’s current question T', and e' 
is Carole’s currently available weight, then she can choose her answer in the set 
{j : e! — r{k' ,j) > 0}. If Carole answers j', then her available weight reduces to 
e' — r{k',j'), and the individual weight of this answer is F{k',j'). 

We think of T as the weighted noise-transition pattern on a channel carrying 
Carole’s answers. The q-ary Ulam-Renyi game with this sort of restriction on the 
patterns of Carole’s lies will be called the game over the channel F. Each pair 
(z,j) with F(i,j) > e stands for an impossible lie: indeed, if Carole answered 
j to a question whose sincere answer is i, already the individual weight of this 
answer would exceed the total weight e. 

The classical g-ary Ulam-Renyi game is a Ulam-Renyi game over a channel 
F, such that F{i,j) = 1 whenever z yf j (see [2]) . Let 

= min{U(t, j) : i yf j}, 

ELinik) = {{j, k) : F{j, k) = wEj, 

9-1 

ELn = U 

fc =0 

We call F a d-rightregular channel, if \EE^{k) \ = d for all k. Trivially, in this case 
l^min(^)l = l^minl/? ^^r each k = 0, . . . ,q — 1. To ease notation we shall write 
Wmin, Amin (fc), Amin for zc^jn , A A „ ( fc) , A A „ , respectively, whenever the channel 
F is clear from the context. 

For any choice of the parameters q, e, M, d, and for any d-rightregular channel 
A, let N^p’'^\M,e) be the minimum number of questions that Paul must ask in 
order to infallibly guess a number x* G {0,1,..., M— 1}, in the g-ary Ulam-Renyi 
game with lies, over the channel A, with total weight e. In this paper we deal 
with the dual problem of determining the largest possible M = M{q, e. A, n) such 
that for all possible choices of the parameters g, e, n, d, and for any d-rightregular 
channel A, there exists a strategy of size n for Paul to infallibly guess a number 
a;* G {0, 1, . . . , M— 1} in the g-ary Ulam-Renyi game with lies over the channel A 
with total weight e. We consider the case M = g'" (to = 0, 1, . . .) and prove that 
the exponent to = m{q,e,F,n) (hence, the quantity M) only depend on A via 
the parameter d. Further, all our strategies can be implemented by procedures 
which use adaptiveness only once. 



( 1 ) 

(2) 

( 3 ) 
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We generalize the results in [3], where optimal and quasi-optimal strategies 
(i.e., strategies whose length differs by at most one from the information theoretic 
lower bound) are introduced for all sufficiently large M and for d-regular channels 
r, satisfying r{i,j) G {0, 1, e + 1}. 

The Ulam-Renyi game was introduced independently in [11], [12] (also see 
[1]). There exist many generalizations of this game, see the surveys [7], [10], [ 8 ]. 
Two-batch strategies were introduced in in [4] [5] (see also [ 6 ]). In [9] the Ulam- 
Renyi game over an arbitrary channel F was studied under the hypothesis that 
F{i,j) G {1, e -I- 1} for all i yf j. 

2 Ulam-Renyi Game on a d-Rightregular Channel 

Suppose Paul and Carole first fix integers q >2, 0 < d < q, M > 1 and e > 0, 
together with a d-rightregular channel F. The search space is identified with the 
set U = {0, 1, . . . , M — 1}. Let Q = {0, 1, . . . , <7 — 1}. Carole chooses a number 
X* G U and Paul must guess it by asking as few as possible g-ary questions. 
Carole is allowed to choose wrong answers with total weight e on the channel 
F. We say that the current weight available to Carole is e — w if the sum of the 
individual weights of her previous answers equals w. 

At any stage of the game, when questions Ti, . . . ,T( have been asked and 
answers b* = bi, ... ,bt have been received (with bi G Q) Paul’s state of knowledge 
is represented by an (e -I- l)-tuple cr = (Aq, Ai,A 2 , . . . , Ag) of pairwise disjoint 
subsets of U, where for each t = 0, 1, 2, . . . , e A^ is the set of elements of U 
which could possibly coincide with x* supposing that the sum of the individual 
weights of Carole’s answers 61 , . . . , equals i. In particular, the initial state uq 
is given by ([/, 0 , 0 , . . . , 0 ). 

Fix j = — 1 and write k = bj+i. Assume Paul is in state aj = 

{Bq, . . . , Be). Then Paul’s new state aj+i = = (Co, ■ • ■ , Cg) resulting from 

Carole’s answer k to question Ty+i is given by 

Ci = (J (^Bi_p(^j f^'f n Tj) . 

{ieC:r(j,fe)<q 

By induction, Carole’s answers bi, . . . ,bt determine a sequence of states 

61 62 bt 

ao = fJ, fJi = O-qS (72 = (7^ : . . . , (7t = = (7 . 

A state (Ao, Ai, A 2 , . . . , Ag) is final iff the set Ao U Ai U A 2 U • • • U Ag 
has at most one element. By a strategy S with n questions we mean the q- 
ary tree of depth n, where each node v is mapped into a question T^, and 
the q edges ? 7 o,? 7 i, • • ■ ,Vq-i generated by w are, respectively from left to right, 
labeled with 0, 1, . . . ,q — 1, which represent Carole’s possible answers to T,^. 
Let »7 = ? 7 i , . . . , ? 7 „ be a path in S, from the root to a leaf, with respective labels 
bi, . . . ,bn, generating nodes v\, . . . ,Vn and associated questions , . . . , . 

We say that strategy S is winning for cr iff for every path q the state cr^ is final. 
A strategy is said to be nonadaptive if all nodes at the same depth of the tree 
are mapped into the same question. 
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3 An Upper Bound for the Size of the Search Space 



Using a result of [9] in this section we shall give an upper bound on the largest 
integer M such that Paul has a strategy of size n to infallibly guess Carole’s 
secret number in the Ulam-Renyi game over the channel F with total weight e, 
over a search space of cardinality M . Our result holds for arbitrary F and for 
all sufficiently large n. 

Theorem 1. [Dumitriu, Spencer] 

Fix integers q > 2 and e > 0 and let Q = {0, 1, . . . , <7 — 1}. Fix F : Q x Q ^ 
{0, 1, e -I- 1} such that F{i,j) = 0, iff i = j, for each i,j G Q. Let E = {(z, j) G 
Qx Q \ F{i,j) > 0}. It follows that, for all e > 0 there exists an integer no > 0 
such that for all n > no, if Paul has a strategy of size n to infallibly guess a 
number a;* G {0, 1, . . . , M — 1} in the q-ary Ulam-Renyi game with lies over the 
channel F, with total weight e, then 



M < 




Q 



n 




As a consequence, recalling the definition of zCmim ^min(^) ^min in (l)-(3) 
we have: 



Theorem 2. Fix integers q> 2 and e > 0 and let Q = {0, 1, . . . , 9 — 1}. Fix a 
function F : Qx INq such that F{i, i) = 0 for each i and F{i,j) > 0 for i yf 
j. Then for all e > 0 there exists an integer no, such that for all integers n > no, 
if Paul has a strategy of size n to infallibly guess a number X:^ G {0, 1, . • . , M—1}, 
in the q-ary Ulam-Renyi game with lies, over the channel F, with total weight e, 
then 




In particular, if F is a d-rightregular channel we have 




L ^min J 



Proof Let w = w^-^. Let F' : Qx Q ^ {0, w, e -I- 1} be defined by 



J F{i,j), whenever F{i,j) G {0,w} 
^e-l- 1 , otherwise. 



For any fixed M, e, n, if Paul has no winning strategy with n questions in a game 
over the channel F' , then a fortiori he has no winning strategy for the game over 
the channel F. In fact, in the channel F Carole can choose her lies more freely 
than in F' , whence Paul’s searching game becomes more difficult. Moreover, in 
the game over F' each one of Carole’s lies weighs exactly w. Since the total 
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weight of her lies cannot exceed e, the maximum number of mendacious answers 
is at most [e/wj . Let F” : Q x Q ^ {0, 1, e + 1} be defined by 

fl whenever r'(i, j) = w 

^ <''J> = |rUi)otherw»^ 

Trivially, the game over F' with total weight e is equivalent to the game over 
F” with total weight \e/w\ . Thus, for all n = 1,2,...,, we have 

M{q, e, F, n) < M{q, e, F' , n) = M{q, \_e/w\ , F" , n). 

Then from Theorem 1 we immediately get the desired conclusion. 

4 Optimal Strategies with Minimum Adaptiveness 
for M = 

In this section we prove the existence of a two-batch-strategy which matches the 
bound given by Theorem 2. The whole section will be devoted to the proof of 
the following theorem. 

Theorem 3. Fix integers e > 0 and q > 2 and let Q = {0, 1, ... ,9 — 1}. Fix 
an integer d G Q and a function F : Q x Q ^ TN satisfying F{i,i) = 0 and 
i G Q, Then for all e > 0 there exists an integer no, such 
that for all integers n > no, if 




then Paul has a strategy of size n to infallibly guess a number a;* € {0, 1, . . . , M— 
1}, in the q-ary Ulam-Renyi game with lies, over the channel F with total 
weight e. Moreover, this strategy involves minimum adaptiveness: Paul asks a 
first batch of nonadaptive questions and then, only depending on Carole’s an- 
swers to these questions, a second and final batch of nonadaptive questions. 

4.1 The First Batch of Questions 

To ease notation we shall write w for Let the function F' ■. Q x Q ^ 

{ 0 , w,w 1} be defined by 

p'u j) = / ^ 

' ^ w -I- 1 otherwise. 

Recalling the definition of m{q, e, F, n) we can write 

m{q, e, F, n) > m{q, e, F' , n). 

As a matter of fact, in the Ulam-Renyi game over the channel F', with total 
weight e, each lie has a weight that cannot exceed the weight of a corresponding 
lie in the Ulam-Renyi game over the channel F. 

^ Notice that, in fact, F defines a d-right regular channel 
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Trivially, any winning strategy for the game over channel F' with total weight 
e is also a winning strategy for the game over channel T with total weight e. 

Henceforth we shall concentrate our attention to the game over the channel 
r' , with total weight e. We shall prove that under the hypothesis (4), there is a 
two-batch strategy for Paul to infallibly guess a number in the set {0, 1, . . . , g'" — 
1} in the Ulam-Renyi game over the (d-right regular) channel F' with total 
weight e. 

The first hatch of questions is defined as follows: For each j = 1, 2, . . . , m, let 
Vj = (Djo,Dji,...,Djq-i) denote the question 

“What is the jth (g-ary) digit of x*?” 

More precisely, a number y € U belongs to Dj * iff the jth digit yj of its g-ary 
expansion y = j/i • • • j/m is equal to i. Let bj G {0, 1, . . . , g — 1} be the answer to 
question T>i. Let b = 6 i • • • bm- Starting from the initial state cr = ([/, 0, . . . , 0), 
Paul’s state resulting from Carole’s answers b\ - ■ - bm is the (e -I- l)-tuple = 
(Ho, Hi,..., He), where for alH = 0, 1, . . . , e, Ai = {y G U \ drfy.h) = i} 
and dr'{y,h) = r'{yi,bi). Therefore, H^ is the set of all g-ary m-tuples 

y having the following property: 

— There exists an integer j G {0, 1, ..., [t/wj } such that {i — jw) / {w + 1) 
is also an integer, and there exist two disjoint subsets of { 1 , 2 , .. . ,m}, say 
W = {ki,k 2 ,...,kj} and W = ■ ■ ■ J{i-wj)/iw+i)}, such that 

{ w u GW 

w+IuGW' 

0 otherwise. 



Thus, 



where 



1-4.1 = Z 



j=0 



W -I- 1 



(5) 



= 

' ’ {0 otherwise. 

We conclude that there exists a constant 0 < 7 < 1 such that, for all suffi- 
ciently large to , and for each z = 0 , 1 , . . . , e. 



1^.1 < 



TO 

[i/w\ 



gL*/H(i_^). 



(6) 



4.2 Intermezzo: Error Correcting Codes a la Gilbert 

Lemma 1. Let the parameters e,q,d, F,m, M and n be as in Theorem 3. Let 
w = and for each z = 0, 1, . . . , e, let Ai be a set satisfying (5)-(6). Then we 
have the inequality 
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( 2\_e/w\ 

I E 

/ V 2=0 



n — m 
i 



- E (7) 



Proof. By (4) there exists a constant 0 < /? < 1 such that, for all sufficiently 
large n, and for all m satisfying (4), we have n > m + (1 — /3) log^ n. It 
follows that 




(8) 



Claim 1. The following inequality holds: 



m < 







d 



e 

-W- 



(9) 



For otherwise (absurdum hypothesis), using (8), we would have 




contradicting (4). Our first claim is settled. 

Claim 2. There exists a constant 0 < 7' < 1, such that 

e 

E (10) 

j=e-w-\-l 



Indeed, Claim 1 together with (6) are to the effect that 

E E (|T |)gL*/“J(i-7)<^^(i-7)mL*J <(i-70?"^^ 

for some 0 < 7' < 1. This settles our second claim. 

To conclude the proof, recalling (6), there exist polynomials p and pi, such 
that 
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hie/w\ / _ \ \ 

< (e-w + i)|Ae-™| ( ^ 



( - -1)! 
'' L 10 J ^ 



p(n - to) + (1 - 7')g" 



(LfJ’ 






UJ-i 



. w . 



< 



( - - 1)! 
'' Lio J ^ 



-p(n — to) 



+(i - y)?””" 



<Pi(n-TO)g L*J +(i_y)g"— 

<yg""™ + (l-y)g”-™. 



( 11 ) 

(12) 

(13) 



(14) 

(15) 

(16) 



Here, 

— (11) follows from the monotonicity of the sizes of the sets Ai, together with 
Claim 2, 

— (12) follows from (6), upon noting that there exists a polynomial in (n — to) 

bounding (e - w + 1)(1 - 7 )gL^J-i ■ 

“ (13) trivially follows from the properties of the binomial coefficient, 

~ (14) follows from Claim 1, 

“ (15) follows because 

d 

— (16) follows choosing no such that for all n > no, n > to + log^ 

The proof is complete. 

Lemma 2. Let the parameters e, g, d, T, to, M, n, tCmin &e as in Theorem 3. Let 
w = tCmin Ai Satisfy (5)-(6) for each z = 0, 1, . . . , e. Then there exist disjoint 
sets, Ci,C 2 , of q- ary tuples of length n — m such that 

(^) icii>E:=o“i^d, 

(a) for all Xi,X 2 G Ci, dH(xi,X 2 ) > 2[^J + 1, 

(iii) IC 2 I > EEe-^+l 

(iv) for all xi G Ci,X 2 G C 2 , dH(xi,X 2 ) > [^\ + 1, 

where dni-,-) denotes Hamming distance between q-ary vectors. 

Proof. The sets Ci and C 2 will be constructed via the following greedy algorithm. 
Let TZ = {0, 1, . . . , g — I}””™. We first build the set Ci C 7^ by the following 
procedure: 






is bounded by a polynomial in n — m, 




90 



Ferdinando Cicalese, Christian Deppe, and Daniele Mundici 



1. Pick an arbitrary vector x gTZ and include it in Ci. 

2. Delete from TZ all vectors y such that y) < 

3- If \Ci\ <YhZo \^r\ go back to 1. 

Each time step 2. is performed, at most (" vectors are deleted 

from TZ. Lemma 1 guarantees that, as long as \C[ \ < YTiZo l^i|> can add new 
elements to €[■. indeed, there are more elements in the set {0, 1, . . . ,q— 
than in the union of and the set of deleted vectors. Once the set Ci has 
been constructed with its X)i=(T 1^*1 vectors, by Lemma 1 in TZ\C\ there still 
exist YTi=e-w+i l^il many vectors which have not been discarded during the 
construction of Ci . These vectors will constitute the set C 2 ■ By direct inspection, 
Cl and C 2 satisfy (ii) and (iv). The proof is complete. 

4.3 The Second Batch of Questions 

To conclude the proof of Theorem 3, we will show that starting from the state 
a second batch of n — to nonadaptive questions is sufficient to guess the secret 
number a;* in the Ulam-Renyi game over the channel F" with total weight e, 
where 

r"(i 7 ) = / ^ ^ 

^0 otherwise. 

The replacement of channel T' by channel T" makes the game easier for Carole, 
because the weight of each possible lie is reduced to w. Thus, if n — to questions 
suffice to find the secret number in the game over the channel F" starting from 
state cr’^, a fortiori n — m questions will suffice in the game over the channel F' . 

Let the encoding function 9 send all elements of Aj one-one onto q-axy 
tuples in Ci, and all elements of one-one onto q-ary tuples in C 2 . 

Let C = Cl U C 2 be the range of 9. Paul’s second batch of questions will ask 

What is the ith digit of the encoding 9{x^.)l for z = 1, . . . , n — to. 

Our constraints on the Hamming distance between any two g-tuples in Ci and 
C 2 are just to ensure that Paul will be able to identify a;*, no matter Carole’s 
mendacious answers. As a matter of fact, there are two possible cases: 

Case 1. The secret number a;* is an element of U jZq Aj. 

Then Carole can give at most [e/zcj mendacious answers. Any tuple x S C 
representing a number yf a;* will differ from x^ in at least 2[e/zcJ -I- 1 digits. 
From the tuple a of Carole’s answers, Paul will correctly guess a;*, as the only 
possible number corresponding to the tuple in C at minimum distance from a. 
Case 2. The secret number belongs to 

Then Carole must give sincere answers to all remaining questions. As a matter 
of fact, each lie weighs at least w and, under our standing hypothesis for this 
case, Carole’s available weight is less then w. Carole’s answers will necessarily 
coincide with the encoding x® of x*. Again, by choosing the tuple of C which 
is closest to the tuple of Carole’s answers, Paul will correctly guess the secret 
number. 
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In either case Paul must only find the tuple in C which is closest to the tuple 

of Carole’s answers. This tuple does correspond to Carole’s secret number x*. 

The proof of Theorem 3 is now complete. 
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Abstract. The transformation protocol can make two players share a 
secret key using a random deal of cards. A sufficient condition on the 
number of cards for the transformation protocol to succeed was known. 
However, it has been an open problem to obtain a necessary and suf- 
ficient condition. This paper improves the transformation protocol and 
gives a necessary and sufficient condition for the improved transforma- 
tion protocol to succeed. 



1 Introduction 

A random deal of cards can be used for players to share a secret. For example, 
Winkler [13] gave bidding conventions for the game of bridge whereby one player 
can send secret information to her partner. This idea was carried further so that 
two players can share a secret key using a random deal of cards [1] . Since then, 
several protocols using a random deal of cards have been developed; Fischer and 
Wright gave two important protocols called the “key set protocol [2,5]” and the 
“transformation protocol [3].” The properties of the key set protocol have been 
investigated extensively [6-12]. For instance, a necessary and sufficient condition 
on the number of cards for the key set protocol to succeed was known [7, 9] . 
On the other hand, concerning the transformation protocol, few results have 
been obtained so far. For instance, a sufficient condition for the transformation 
protocol to succeed was known [3]. However, it has been an open problem to 
obtain a necessary and sufficient condition. In this paper, we will address only 
the transformation protocol, and close the open problem above. 

The scenario is as follows. Two players Alice and Bob communicate publicly, 
while a passive computationally-unlimited eavesdropper Eve overhears all com- 
munication. Alice and Bob are assumed to use randomization, that is, they can 
flip private fair coins. Let n be a positive real number such that n = log 2 m for 
some integer m > 2 and hence m = 2”. Alice and Bob wish to share an n-bit 
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secret key v G {1, 2, 3, • • • , 2"(= m)} which Eve cannot learn. That is, they wish 
to share a value v G {1,2, 3, •••,2”} such that, given the information available 
to Eve, the (conditional) probability of u = ^ is exactly 1/2" for every £ G [1, 2"]. 

Let A = (1, 2, • • • , c?} be a deck of d distinct cards; an element in the deck A 
is a card. We call a subset H Q A oi A & hand. A sequence 5 = He) 

of three hands such that {Ha, Hi,, He} is a partition of A is called a deal. A deal 
S = {Ha, Ht; He) means that every card in A is dealt to Alice, Bob or Eve so 
that Alice, Bob and Eve have hands Ha, Hb and He, respectively, as in the case 
of usual card games. We call 7 = (a, 6 ; e) the signature of a deal S = {Ha, Ht; He) 
if a = \Ha\, b = \Hb\ and e = \He\, where \X\ denotes the cardinality of a set X. 

Fix a signature 7 = (a, b; e) with a,b > 1. For such 7 , we always fix the deck 
A = (1,2, •••,0 + 6 + e}. Then, there are exactly {°‘^a^'^) • deals having 
the signature 7 . Assume that Alice, Bob and Eve have their hands Ha, Hb and 
He, respectively, from a random deal 6 = {Ha, Hb; He) whose signature is 7 . 
As in the case of usual card games, all the cards in her/his hand are private 
to herself/himself. Given such a random deal 6, Alice and Bob wish to share a 
secret key: the goal is to design a protocol which makes Alice and Bob share an 
n-bit secret key that Eve cannot learn. We say that a protocol establishes an 
n-bit secret key exchange for a signature 7 = {a, b; e) if the protocol always makes 
Alice and Bob share an n-bit secret key uG{ 1,2,3,---,2"} for any random deal 
S = {Ha, Hb', He) having the signature 7 and any random result of flipping their 
coins. 

In this paper, we first improve the transformation protocol. Our “improved 
transformation protocol” is superior to the transformation protocol. That is, 
the improved transformation protocol establishes an n-bit secret key exchange 
for any signature 7 for which the (original) transformation protocol does. We 
then give a necessary and sufficient condition for the improved transformation 
protocol to establish an n-bit secret key exchange for a signature 7 = (a, 6;e). 
We thus close the open problem above. 



2 Preliminaries 

In this section, we define some terms, and describe the transformation proto- 
col [3] given by Fischer and Wright. Fix a signature 7 = (a, 6 ; e) with a,b > 1, 
and let S = {Ha, Hb; He) be a random deal having the signature 7 . 

A subset S' C Z\ of the deck A is called an {s,i, j) -portion relative to S if 
s = |S|, i,j > 1, and S contains exactly i cards from Alice’s hand Ha, exactly 
j cards from Bob’s hand Hb and exactly s — i — j cards from Eve’s hand He. 
We often omit the phrase “relative to 6 ” if it is clear from the context. An 
(s, t, j)-portion S is said to be complete if s = f + j, i.e. Eve has no card in S. 
Furthermore, an (s, i, j)-portion S is said to be partial if s > z + j, i.e. Eve has 
at least one card in S. A portion S is said to be opaque if Eve does not know 
anything about the location of the cards in S\He. Consider the case where Alice 
and Bob obtain an opaque complete (s, z, j)-portion S, i.e. an opaque (z + j, i,j)~ 
portion S. Since Eve has no card in S, Alice and Bob completely know the 
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owners of all cards in S, but Eve knows nothing about it. Therefore, from the 
portion S, Alice and Bob can share a log 2 (*^'’)-bit secret key. Thus, an opaque 
complete portion immediately brings Alice and Bob a secret key. 

A set C of pairwise disjoint portions relative to S is called a collection relative 
to S. We often omit the phrase “relative to i5” if it is clear from the context. 
We say that a collection C = {81,82, ■ ■ ■ , S'™} is opaque if Eve does not know 
anything about the location of the cards in 8i\He, 82\Hg, - , 8m \He- If Alice 

and Bob obtain an opaque collection C containing complete portions, then they 
can share a secret key. 

During any execution of the transformation protocol, Alice and Bob start 
with the initial collection Co = {^}, change Co into another collection Ci, change 
Cl into C 2 , and so on. They finally obtain an opaque terminal collection Ct- A 
collection Ci can be changed into another collection C^+i, 0<^<t— l,bya split- 
ting transformation or a combining transformation. A splitting transformation 
replaces an (s, t, j)-portion in the current collection with several smaller por- 
tions. A combining transformation replaces two (s, 1, l)-portions in the current 
collection with a single (s', 1, l)-portion for some s' < s. 

To simplify the notation, we hereafter denote by C the current collection 
which Alice and Bob maintain if it is clear from the context. Alice and Bob start 
with C = Co = {^}. We sometimes use C' to represent a collection resulting from 
the current collection C by some transformation. 

We first present how to apply a splitting transformation to C, i.e. how to 
split a portion S' in C. 

Splitting: An (s, i, j)-portion 8 with i j > 3 can be split so that several 
smaller new portions will be acquired. If f > j, then the splitting transformation 
proceeds as described below. If f < j, then the roles of Alice and Bob are reversed. 

1. Alice randomly partitions 8 into i sets S(, S 2 , • • • , S', each of size \_s/i\ or 
\s/i\, such that she has exactly one card in each set, and announces the sets. 

2. Bob says how many cards he has in each set announced by Alice. 

3. Each set in which Bob has at least one card is acquired as a new portion. 

Notice that any (s, 1, l)-portion cannot be split. For an (s, i, j)-portion S, 
we say that S is splittable if f -I- j > 3; and S is non-splittable ii i j = 2, i.e. 
i = j = 1. 

Alice and Bob repeat applying a splitting transformation to C until any por- 
tion in C cannot be split. Then each portion S^, 1 < ^ < m, in the current 
collection C = {Si, S 2 , • • • , Sm} is a (|S^|, 1, l)-portion. They next repeat ap- 
plying the following combining transformation to C, i.e. repeat combining two 
portions Si and 82 having the same sizes in C. 

Combining: Two (s, 1, l)-portions Si and 82 with s > 3 can be combined so 
that a new portion S' will be acquired. 

1. Alice randomly chooses an integer p G {1,2}. Let q = 3 — p. 

2. Alice constructs and announces a set T consisting of her card in 8 p, [s/3j — 1 
cards randomly chosen from Sp that are not hers, and [s/3j cards randomly 
chosen from 8q that are not hers. 
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3. Bob announces how many cards he has in T . 

(a) If Bob has no card in T, then Alice announces the set difference Sq \ T, 
which is acquired as a new portion S' = Sq\T. 

(b) If Bob has exactly one card in T, then T is acquired as a new portion 
S' = T. 

(c) If Bob has two cards in T, then Alice announces SpOT, which is acquired 
as a new portion S' = SpDT. 

As Alice and Bob repeat combining two portions in the current collection 
C, the sizes of the portions in C become smaller. Notice that (2, 1, l)-portions 
cannot be combined. When they cannot apply a combining transformation to 
C, they obtain a terminal collection C = Ct, the terminal collection Ct possibly 
contains (2, 1, l)-portions, which can be used to share a secret key. 

We are now ready to give the full description of the transformation protocol. 
Given a random deal 8 = {Ha,Hb; He), the transformation protocol proceeds as 
follows. 

Transformation protocol: 

1. The initial collection is C = Cq = {^}- 

2. Splitting is repeated as long as there is a splittable portion, i.e. an {s,i,j)~ 
portion with i + j > 3, in C: choose such a portion S' in C according to any 
prearranged rule, remove S from C, apply a splitting transformation to S, 
and add all the new acquired portions to C. 

3. Combining is repeated as long as there is a pair of (s, 1, l)-portions with s > 3 
in C: choose such two portions Si and S 2 in C according to any prearranged 
rule, remove both Si and S 2 from C, apply a combining transformation to 
Si and S 2 , and add the new acquired portion to C. 

4. From the terminal collection C = Ct, Alice and Bob share an n-bit secret 
key, where n is the number of (2, 1, l)-portions in Ct- 

We now describe the definitions of the “potential function” (j) and the constant 
W [3]. Let c = logg /2 2. The potential function <j){s,i,j) is recursively defined as 
follows: 

{ 2 if s = 2 and i = j = 1; 

(s-2)-" ifs>3andi = j = l; 

j(j>{\s/i'], 1, 1) if i > j and i >2; and 

(j){s,j,i) ifz<j. 

The constant W is defined as IF = ~ 2)“°. (One can show that 2.0356 < 

W < 2.0358 [3].) Using (j) and W, we present the sufficient condition given by 
Fischer and Wright as in the following Theorem 1. 

Theorem 1 ([3]) Let n he a positive integer, and let 7 = (a, 6; e) he a signature 
with a,h > 1. If 4>{a + h+e, a,h) > W + 2(n— 1), then the transformation protoeol 
establishes an n-hit secret key exchange for 7. 

The condition in Theorem 1, i.e. <j){a + b + e,a,h) > W + 2(n — 1), is a 
sufficient condition for the transformation protocol to establish an n-bit secret 
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key exchange for 7 = (a, 6; e). However, it is not a necessary condition in general. 
It has been an open problem to obtain a necessary and sufficient condition. This 
paper closes the open problem as in the succeeding section. 



3 Improved Transformation Protocol 

In this section, we first slightly modify the transformation protocol, and then give 
a necessary and sufficient condition for our improved transformation protocol to 
establish an n-bit secret key exchange for a signature 7 = (a,b;e). There are 
two main ideas behind the modification. In the remainder of this paper, all 
logarithms are to the base 2. 



3.1 Stopping Useless Splitting Transformations 

In this subsection, we explain the first improvement, i.e. stopping a “useless” 
splitting transformation; the idea behind the improvement is naive. 

We now explain a “useless” splitting transformation, as follows. Assume that, 
during the execution of the transformation protocol, Alice and Bob obtain an 
opaque complete splittable portion, say an opaque (3, 1, 2)-portion S. According 
to the transformation protocol, S is eventually split into a (2, 1, l)-portion S[ and 
a singleton set S'2 consisting of one card from Bob’s hand. Since the singleton set 
S '2 contains no Alice’s card, S'2 is discarded, and hence only the (2, 1, l)-portion 
S^ is acquired as a new portion. Comparing the original (3, l,2)-portion S and 
the acquired (2, 1, l)-portion S[, S is preferable to S[, because a log 3-bit secret 
key is distilled from S while only a one-bit secret key is distilled from S[. Thus, 
it is “useless” to split S. More generally, one can immediately notice that it is 
useless to split a complete portion, i.e. an (z -|- j, z, j)-portion for some z and j, 
which can be used to share a log (*~^^)-bit secret key. Therefore, we never split 
a complete portion in our transformation protocol. This is the idea behind the 
first improvement. The full description of our transformation protocol will be 
given in Section 3.3. 

In the remainder of this subsection, we introduce two functions tpc and z/^p, 
which will be used later to describe a necessary and sufficient condition for our 
transformation protocol to establish an zz-bit secret key exchange for a signature 
7 = (a,6;e). 

We first introduce a function 'tpc which maps a portion S (relative to a deal <5) 
to a nonnegative real number. (Strictly speaking, the variable of the function tpc 
should be a pair (S', i5) instead of S, but we write simply tpc{S).) The function 
tpc is called the completely pote'ntial fwnction. Intuitively, it means that a tpc{S)~ 
bit secret key can be distilled directly from a portion S. Remember that, from 
a complete portion, namely an (z -|- j, z, j)-portion, Alice and Bob can share 
a log (*“[''’)-bit secret key. Hence, we define the completely potential function 
tpc{S) = tpc{s,i,j) for an (s, z, j)-portion S as follows: 






log (*+■’) if s = z-kj; 
0 otherwise. 



( 1 ) 
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We next introduce a function ip-p which maps a portion S (relative to a 
deal 6) to a nonnegative real number. The function ^/>p is called the partially 
potential function. Roughly speaking, whereas a i/'c(<S')-bit secret key can always 
be distilled directly from a portion S, a i/’p(<S')-bit secret key may be distilled 
from S by splitting and/or combining. 

We first define the partially potential function ippiS) = 'ipp{s,i,j) for a com- 
plete portion S, i.e. an (s, i, j)-portion S with s = i + j. Since such a portion S 
is never split and never combined, we define 

V'p(s,*.j) = 0 ifs = i + j. (2) 

We then define ipp for a partial portion S, i.e. an (s, i, j)-portion S with 
s > i + j. We will define ftp for a partial non-splittable portion S in the suc- 
ceeding subsection; thus, we now consider a partial splittable portion S, i.e. an 
(s, f, /(-portion S such that s > i+j > 3. Note that such a portion S will be split. 
We recursively set tpp{S) to be the summation of 'ipp{S[),'ipp{S 2 ), • • • , ipp{S'^), 
where S[, S!^,- ■ ■ , are the portions acquired by some particular splitting trans- 
formation, called the “worst” splitting. Assume for the moment that i > j, i.e. 
Alice does not have fewer cards in S than Bob. Furthermore, assume that Alice 
does not have more cards in S than Eve, i.e. s — (f -I- /) > z, or s > 2f -|- j. As 
the “worst” splitting, we consider the case where Alice would split the portion 
S into z subportions so that Bob has exactly one card in every S[, 1 < £ < j. 
One may assume that S'.^, S' 2 , ■ ■ ■ , S[ are sorted in non-increasing order of their 
cardinalities. Note, in this case, that each of the acquired portions • • • , 5' 

is either a ([s/z], 1, l)-portion or a ([s/zj, 1, l)-portion. Note, furthermore, that 
each of the first j subportions S'(, • • • , S'' is a partial portion, i.e. |S^| > 3 

for every t, 1 < t < j, because Eve has z or more cards. We now count the 
numbers of (|"s/z], 1, l)-portions and ([s/zJ, 1, l)-portions. Let r be the remain- 
der when dividing s by z, that is, let r = s mod i. If r = 0, then there are 
exactly j {s/i, 1, l)-portions. If 1 < r < /, then there are exactly r (|"s/zl, 1, 1)- 
portions and exactly j — r ([s/zJ , 1, 1 (-portions. If j < r, then there are exactly 
j ([s/z] , 1, l(-portions. Thus, for an (s, z, /(-portion S such that s > z -I- / > 3, 
i > j and s > 2z -|- /, we define 

f/V'p(s/z,l,l( ifr = 0; 

i’p{s,i,j) = < ri/p([s/z],l, 1( -b (/ - r(z/>p([s/zj, 1,1( if 1 < r < /; (3( 

[ji’p{\s/i^,l,^) if/<A 

where r = s mod z. Next assume that Alice has more cards than Eve, i.e. s — 
(i + j) < z, or s < 2i + j. Then, at least one (2, 1, l(-portion is always produced, 
and hence, for an (s, z, /(-portion S such that s > i + j > 3, i > j and s < 2i + j, 
we define 



z/p(s, z,/( = 1 if s > z -b / > 3, z > / and s < 2z -b /. (4( 

For the case of z < /, we define 



z/p(s,z,/( = tpp{s,j,i) a s> i + j >3 and i < j. 



( 5 ( 
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3.2 Combining with Dummy Cards 

In this subsection, we explain the second improvement; we introduce an oper- 
ation called “combining with dummy cards,” whereby Alice and Bob can effi- 
ciently utilize “unused” portions. 

We first explain an unused portion. Consider the case where Alice and Bob 
obtain a terminal collection Ct = {S\, 82 , - ■ ■ , Sm} when the transformation pro- 
tocol terminates. There is no pair of portions Sg and Si in Ct with Sg = > 3. If 

all the m portions in Ct are (2, 1, l)-portions, then Alice and Bob share an m-bit 
secret key, and hence there is no “unused” portion. However, if Ct contains an 
(s, 1, l)-portion with s > 3, then such a portion is not used to share a secret key, 
and hence it is unused. 

In order to utilize unused portions, we need to combine two portions of 
different sizes. For this purpose, we add “dummy” cards to the smaller portion. 
For example, consider a (6, 1, l)-portion and a (5, 1, l)-portion S 2 - We add 
one dummy card x to the portion 82 so that the resulting portion C /2 = S '2 U {a;} 
has the same size as S\. The dummy card x is chosen not in U S' 2 . Alice 
and Bob regard the dummy card x as Eve’s card. Then, we apply to Si and U 2 
a combining transformation described in Section 2. Let U' be the new portion 
acquired by combining. If U' has the dummy card x, then remove it from U' . 
That is, let S' = U' \ {x}, which is acquired as a new portion. In this way, 
one can combine two portions of different sizes. We thus obtain the following 
operation, called combining with dummy cards. 

Combining with dummy cards: An (si, 1, l)-portion Si and an (s 2 ,l,l)- 
portion S 2 with si > S 2 > 3 can be combined so that a new portion S' will be 
acquired, as follows. 

1. Let D be any set of dummy cards such that \D\ = S\ — S 2 and Dn(SiUS 2 ) = 
0. All the dummy cards in D are added to S 2 , that is, let U\ = S\ and 
C /2 = S 2 U D. Note that D is an empty set if si = S 2 . 

2. Alice randomly chooses an integer p G {1, 2}. Let q = 3 — p. 

3. Alice constructs and announces a set T consisting of her card in Up, [si/3j — 1 
cards randomly chosen from Up that are not hers, and [si/3j cards randomly 
chosen from Ug that are not hers. 

4. Bob announces how many cards he has in T. 

(a) If Bob has no cards in T, then Alice announces the set difference Uq\T 
and let U' = Uq \ T. 

(b) If Bob has exactly one card in T, then let U' = T. 

(c) If Bob has two cards in T, then Alice announces Up HT and let U' = 

C/p nr. 

5 . If U' has dummy cards, then remove them from U' , i.e. let S' = U'\D. Alice 
and Bob acquire S' as a new portion. 

When an (si, 1, l)-portion ^i and an (s 2 , 1, l)-portion S 2 such that si > 
S 2 are combined with dummy cards, the acquired portion S' has size at most 
[2si/3]; in particular, if Alice chooses p = 2 in step 2 and Bob has no card in T 
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announced by Alice, then the acquired portion S' = {Ui\T)\D = S'i\T contains 
no dummy card, and hence |S"| = |"2si/3]. Thus, in the “worst” combining, the 
acquired portion S' always has the size of exactly |"2si/3] . 

Since we have not defined tpp for a partial non-splittable portion, i.e. an 
(s, i, j)-portion with s > i + j = 2, we complete the definition of ipp in the 
remainder of this subsection. That is, we define tpp{s, 1, 1) for s > 3. Note that, 
by combining, two (3, 1, l)-portions become a (2, 1, l)-portion which yields a one- 
bit secret key, and that two (s, 1, l)-portions with s > 4 become a ([2s/3] , 1, 1)- 
portion in the “worst” case. Thus, we recursively define 

J V'p(3,l,l) = 1/2; and , . 

lV’p(s, 1,1) = 5V'p(r2s/3],l,l)if s>4. 

Notice that i/p(s, 1,1) is monotonically decreasing in s for all integers s > 3. 
We have thus completed the definition of ipp. In our transformation protocol 
whose full description will appear in the succeeding subsection, we combine 
an (si, 1, l)-portion S\ and an (s 2 , 1, l)-portion S 2 with dummy cards only if 
ipp{Si) = 'ipp{S 2 }', otherwise, the “partially potential” may decrease; for exam- 
ple, if a (3, 1, l)-portion and a (4, 1, l)-portion S 2 were combined, then a 
(3, 1, l)-portion S' would be obtained in the “worst” case, and hence the “par- 
tially potential” decreases by 1/2^. 

3.3 Our Protocol and Results 

We generalize a key set protocol [2, 5] to a “multiple key sets protocol,” whose 
definition is omitted in this extended abstract due to page limitation. As ex- 
plained in Sections 3.1 and 3.2, we modify the transformation protocol and 
obtain the following improved transformation protocol. Given a random deal S, 
the improved transformation protocol proceeds as follows. 

Improved transformation protocol: 

1. If the signature 7 = (a, 6; e) of 6 satisfies 0 < e < max{a, 5}, then the 
multiple key sets protocol is executed so that Alice and Bob share at least 
a minjo, b, [(a -I- 6 — e) /2J }-bit secret key, and the improved transformation 
protocol terminates. If e = 0 or e > maxja, b}, then go to step 2. 

2. The initial collection is C = Cq = {^}- 

3. Splitting is repeated as long as C contains a partial splittable portion, i.e. an 
(s, i, j)-portion with s > z -I- j > 3: choose such a portion S' in C according 
to any prearranged rule, remove S from C, apply a splitting transformation 
to S, and add all the new acquired portions to C. 

4. The operation of the combining with dummy cards is repeated as long as 
there are two portions Si and S 2 in C such that Si is an (si, 1, l)-portion, 
S 2 is an (s 2 , 1, l)-portion, si > S 2 > 3, and z/p(Si) = z/p(S 2 ): choose such 
two portions Si and S 2 in C according to any prearranged rule, remove both 
Si and S 2 from C, apply a combining transformation with dummy cards to 
Si and S 2 , and add the new acquired portion to C. 
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5. Alice and Bob share a '0c(<S')-bit secret key from the terminal collec- 

tion C = Ct, and the improved transformation protocol terminates. 

In step 1, we use the multiple key sets protocol because this protocol is effective 
when a signature 7 = (a, b; e) satisfies 0 < e < max{a, b}. 

We now give the definition of our potential function which maps a collection 
C to a nonnegative real number as follows: 



V>(C) = ^V'c(5) + 

Sec 



.Sec 



(7) 



(We takes the floor in the second term of the right hand side, because a set of 
partial portions is transformed into several ( 2 , 1 , l)-portions, each of which yields 
a one-bit secret key.) We write tp{a + b + e,a,b) instead of tp{Co) if Cq = {^} is 
a singleton collection relative to a deal S having a signature 7 = (a, b; e). 

Considering the case where the multiple key sets protocol runs, we define a 
function b; e) for a signature 7 = (a, b; e) with a,b> 1 , as follows: 



J min{a, 6, [(a -1-5 — e)/2 j}if0<e< max{a, 5}; 
( ■0(a -I- 5 -I- e, a, 5) otherwise. 



(8) 



We have the following Theorem 2 as our main result. We omit a proof of 
Theorem 2 due to the page limitation. 



Theorem 2 Let n = log to for an integer m > 2, and let 7 = (a,b;e) be a 
signature with a,b > 1. Then the improved transformation protocol establishes 
an n-bit secret key exchange for 7 if and only z/'f'(a, 5; e) > n. 



4 Conclusions 

The transformation protocol can efficiently make players share a perfect secret 
key using a random deal of cards. A sufficient condition for the transformation 
protocol to establish an n-bit secret key exchange was known. However, it has 
been an open problem to obtain a necessary and sufficient condition. This pa- 
per improves the transformation protocol, and gives a necessary and sufficient 
condition for the improved transformation protocol to establish an n-bit secret 
key exchange as in Theorem 2. Our improved transformation protocol is entirely 
superior to the original transformation protocol. Thus, Theorem 2 closes the 
open problem above. 

Fischer and Wright [3, 5] proposed a method for reducing the problem of a 
multiparty n-bit secret key exchange to the problem of a 2 -party n-bit secret 
key exchange. Hence, using this method, one can easily extend our protocol so 
that it performs a /c-party n-bit secret key exchange with k > i. 

This paper addresses only the transformation protocol. Therefore, it still re- 
mains open to obtain a necessary and sufficient condition for any (not necessarily 
transformation) protocol to establish an n-bit secret key exchange for a signature 
7 [4, 14]. 
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1 Introduction and Problem Statement 

Traditional selection and assignment problems analyze how to select and as- 
sign the suppliers to meet demands within minimum total costs. The maximum 
capacity and the fixed selection costs are two major restrictions imposed on 
suppliers, and extensively studied in literature. 

In practice, another constraint, named as Minimum Quantity Commitment 
(MQC), needs to be satisfied by each selected supplier. It is stipulated for the 
global transportation by the US Federal Maritime Commission. According to 
the MQC, when assigning containers for shipments to US, the international 
companies, including the Royal Philips Electronics Company, must insure each 
selected shipping agent to have a certain minimum quantity of containers. This 
increases difficulties for them to seek the optimal schedule with minimum total 
costs. The MQC constraint has been seldom studied before, which motivates us 
to present its hardness results for the selection and assignment problems. 

Before formulating the new MQC constraint, we review some traditional 
problems first. Let J = {1, ...,n} denote the set of transportation requests, each 
requiring dj {j G J) containers to be shipped from a source to a destination. 
Let / = {1, ..., m} denote the set of shipping agents, and each agent z G / bids a 
transportation cost, denoted by Cj^-, for shipping a container in request j. The 
basic selection and assignment problem can be defined to select a subset A C I 
of shipping agents, and to assign every i G A a, proper number of containers, say 
Zij, for each request j G J, so that the following total cost is minimized: 

( 1 ) 

ieAjeJ 

while the demand constraint is satisfied: 

''^^Zij = dj, (2) 

i^A 

This basic problem can be easily solved by a greedy approach, that is to assign 
each request for container shipment to the lowest bidder or agent. However, it 
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becomes more difficult, if the maximum capacity, Si, for each agent z G J is 
considered, which leads the capacity constraint: 

(3) 

ieJ 

The capacitated variant above is known as the transportation problem [1], 
which can not be solved by the simple greedy method, but can be transformed 
to a minimum cost flow problem and is still polynomial solvable [1]. 

Besides the transportation cost, the flxed selection cost fi for selecting each 
agent z G A is often considered in literature as well. It extends the objective 
functions to be: 



( 4 ) 

iGA iGAJGJ 

The flxed selection cost makes the problem intractable. No matter whether 
the capacity constraint (3) is considered (or not), the selection and assignment 
problem with flxed selection costs, called uncapacitated (or capacitated resp.) 
facility location problem [2,6], is unary TVP-hard [5]. However, being part of the 
objective function, the flxed selection cost does not affect the search for feasible 
solutions. As shown in Section 3.2, we can easily generate feasible solutions to 
satisfy the demand constraint (2) and/or the maximum capacity constraint (3). 

However, finding feasible solutions might become intractable if the MQC 
constraint is considered. We generalize the MQC constraint to w regions. Let 
R = {1, ...,w} denote the set of regions with MQC constraint. For each region 
r G R, its minimum quantity is br and the set of transportation requests with 
destinations in r is Jr J ■ It is reasonable to assume that Jp and Jq are disjoint 
for any two different regions p and q in R. Therefore, the MQC constraint can 
be modeled as: 



= 0, or ^ Zij > br (5) 

ieJr jeA 

Hence, the selection and assignment problem with the MQC constraint is to 
minimize the total cost (4) subjected to the demand constraint (2), the maximum 
capacity constraint (3) and the new MQC constraint (5). Its solution, represented 
by the selected agent set A and the assignments {zij}, is feasible if and only if it 
satisfies the above three constraints, and is optimal if and only if it is a feasible 
solution and has the minimum total cost (4). The problem to And an optimal 
solution is the optimization variant of this problem. Similarly, the problem of 
finding a feasible solution is the feasibility variant of this problem. 

Our paper presents the computational complexity results of finding an opti- 
mal (or feasible) solution for the selection and assignment problem with mini- 
mum quantity commitments. We study various special cases with different spec- 
ifications of maximum capacity Si, flxed selection cost fi, and number of MQC 
regions w. For each special case, we will either give a polynomial algorithm, or 
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prove its binary A/"7^-hardness [5] with a pseudo-polynomial algorithm, or prove 
its unary A/’T^-hardness [5]. Our result reveals how the new MQC constraint 
affects the hardness of the selection and assignment problem. Due to space lim- 
itation, details of some proofs are left out here, but can be found in the long 
paper [8]. 

To make consistent the notations throughout the paper, let Jq be the set 
of transportation requests not subjected to the MQC constraint, and fix its 
minimum quantity bo to be zero. Let U = i?U{0} denote the universal set, where 
region 0 is the complement of all the regions with MQC restriction. Let D = 
^j^jdj denote the total demand of transportation requests and S = 
denotes the total maximum capacity of shipping agents. These two totals often 
appear in this paper. 



2 Classifications 

To help with our classification, a four-field notation Obj{a, f3,j) is proposed 
here, where Obj defines the objective (optimization or feasibility), a indicates 
the number of regions with the MQC constraint, j3 gives the maximum capacity 
for agents, and 7 describes the fixed selection cost. 

The field Obj is the objective function. It has two possibilities: 

~ Optimization', the objective is to find the optimal solution; 

~ Feasibility: the objective is relaxed, we only want to find a feasible solution. 

The field a indicates the number of regions subjected to MQC constraint, a 
has the following possibilities: 

— w: the MQC region number is arbitrary and specified as a part of the prob- 
lem’s input; 

— w = X: the MQC region number is a constant, and is excluded from the 
problem’s input; 

— w = 0, 1, 2, or etc.: the MQC region number is a given integer. For example, 
if a is equal to w = 1, only one region has the MQC constraint. 

The field /? describes the maximum capacities of agents. /? can take the 
following values: 

— sf. the maximum capacities of agents are arbitrary; 

— Si = 00 : the maximum capacities of agents are all infinity, i.e. the maximum 
capacity restriction (3) is relaxed. 

Lastly, the field 7 describes the fixed selection costs of agents, 7 can be: 

— fi'. the fixed selection costs of agents are arbitrary; 

— /i = 0 : the fixed selection costs of agents are all zero, i.e. the total cost of a 
solution is relaxed from (4) to the total transportation cost (1) only. 
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3 Computational Complexity 

The main complexity results are summarized in Table 1. From the results, we 
can derive a complete map of the computational complexity results for all special 
cases by their reduction relations shown in Section 2. In Table 1, we use Unary 
AfT^-hard and Binary AfT^-hard, defined in [5], to express that a problem is 
AfT^-hard with respect to a unary and binary encoding of the data respectively. 
Along with each result, we provide a reference where its proof can be found. The 
entries in the 7 column of the the Feasibility rows imply that those results 
hold true for any specification of the fixed selection costs {fi or /^ = 0), because 
the total cost (4) can be neglected when a feasible solution is considered. The 
complexity results show that 

1. finding an optimal solution is unary A/T^-hard unless both MQC restrictions 
and fixed selection costs are relaxed, i.e., w = 0 and fi = 0, and 

2. finding only a feasible solution is at least binary A/”7^-hard unless at most one 
region holds the MQC rule or the maximum capacity is relaxed, i.e., w < 1 
or Si = 00. 



Table 1. The Complexity Results 



Objective 


a 


p 


7 


Complexity Results 


Optimization 


W = 0 


Si = 00 


fi = 0 


0{nm) Thm 1 


w = 0 


Si 


.fi = 0 


0{{n + mpiog{n + m)) [7J 


w = 0 


Si = 00 


.fi 


Unary AfP-hard [4J 


w = 1 


Si = 00 


fi = 0 


Unary A/’A’-hard Thm 4 


Feasibility 


w 


Si = 00 


* 


0{nm) Thm 5 


w = 0 


Si 


* 


0{nm) Thm 6 


w = 1 


Si 


* 


0(nm) Thm 9 


w = 2 


Si 


* 


Binary AfP-hard Thm 8 

OijnS^ -1- nm) Cor 1 


w = X 


Si 


* 


Binary MP-hard Cor 2 

0{mS^^'^^ + nm) Thm 7 


w 


Si 


* 


Unary NP-hard Thm 10 



3.1 Complexity of Finding an Optimal Solution 

We begin with the case Optimization{w = 0,Si = 00 , fi = 0), which can be 
reduced to all other optimization cases by the reduction relations shown in Sec- 
tion 2. For this case, both the maximum capacity constraint (3) and the MQC 
constraint (5) are relaxed. The absence of the fixed selection cost allows us to 
select all the agents without penalty. Therefore, we can assign each transporta- 
tion request greedily to the agent with the lowest bid for that request. It is easy 
to see the total time complexity is 0{mn), leading the following result. 

Theorem 1. Optimization{w = 0 ,Si = 00 , fi = 0) can he solved in 0{nm) 
time. 
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The next case Optimization{w = 0, Si, fi = 0) is equivalent to the trans- 
portation problem [1]. As we mentioned in Section 1, it can be solved by the 
minimum cost flow algorithm presented in [7]: 

Theorem 2. The minimum cost flow algorithm generates an optimal solution 
for the case Optimization{w = 0, Sj, fl = 0) in 0((n -I- mfllog{n + m)) time. 

We now consider the case Optimization{w = 0, Sj = oo, fl) that has flxed 
selection costs fl without the maximum capacity constraint and the MQC con- 
straint. Since this case is equivalent to the uncapacitated facility location prob- 
lem [4], a well-known unary AfV-hard problem, we have the following result: 

Theorem 3. Finding an optimal solution for the case Optimization{w = 0, 
Si = oo, fi) is unary AfV-hard. 

Next we consider the case Optimization{w = l,Si = oo, fi = 0) that has one 
the MQC constraint for one region but relaxes the maximum capacity constraint 
and ignores the flxed selection costs. It can be proved to be unary A/”7^-hard by 
a reduction from Cover By 3-Sets (X3C) problem [5]. (See [ 8 ].) 

Theorem 4. Finding an optimal solution for the case Optimization{w = 1, 
Si = oo, fi = 0) is unary NV-hard. 

3.2 Complexity of Finding a Feasible Solution 

Feasibility cases are studied in this part. It needs to consider the constraints 
(2) (3) (5) but not the total cost (4). So, let the held 7 be as the flxed 
selection cost does not change the complexity results. Therefore, we can select 
all the agents safely by assuming A = {1,2, and concentrate our study 

on how to obtain the assignment Zij for z G A and j G J. 

For uncapacitated cases with Si = 00 , let us look at the most general case 
Feasibility{w, Si = 00 ,*). Its feasible assignment can be constructed easily by 
allocating all the transportation requests with full demands to a certain agent. 
The process is described in Algorithm 1, whose correctness can be easily seen. 

Theorem 5. Algorithm 1 generates a feasible solution for the case Feasibility 
{w, Si = 00 , *) in 0(nm) time. 



Algorithm 1 Solving Feasibility{w,Si = 00 ,*) 

1: if for some region r G K, 6r exceed the total demands D of requests in region r then 
2: Output “No Feasible Solution”; 

3l else 

4: Initially, Zij •«— 0 for all i G I and j G J; 

5l Select all the shipping agents: A •«— {1, 2, m}; 

6l for all the transportation request j G J do 

7’. Assign full demands of request j to the agent 1: zij •«— dj\ 

end for 

9l Output the selected set A and assignments zi^j for all i ^ A and j G J; 

10: end if 



For the capacitated case with (3 = Si^ different specifications of a will be 
studied. Prior to that, the following necessary condition can be easily observed. 
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Lemma 1. For the capacitated case Feasibility{a, Si,*) with any specification 
of a, if a feasible solution exists, then the sum of the maximum capacity for all 
agent must be at least as large as the total demand, i.e. S > D. 

Consider the case Feasibility{w = 0, Si, *) which relaxes the MQC constraint. 
We can obtain its feasible solution efficiently as follows. 

Theorem 6. F easibility{w = 0,Si,*) can be solved in 0{nm) time. 

Proof. Maintain two lists. One contains all the unsatisfied requests and the other 
stores all the available agents. Let u denote the first unsatisfied request with 
remaining demand > 0, and p the first available agent with remaining capacity 
Sp > 0. Accordingly, if Sp > d'p, we can assign all the remaining demands, d^, 
to the agent, p, decrease the remaining capacity, s^, and remove the request, u, 
from the request list; otherwise, we only assign demands of u to p, decrease 
the d^, and remove p from the agent list. The above process is continued until 
all the requests have been satisfied. It is easy to see that if the total agents’ 
maximum capacity is at least as large as the total demands {S > D), the above 
process will stop with a feasible solution; otherwise no feasible solution exists 
by Lemma 1. Note that for each loop, either a request or an agent is removed 
and that there are n requests and m agents. Therefore, the time complexity in 
the loop is 0(n + m). Considering that initializations consume 0{nm) time, the 
total time complexity is 0{nm). □ 

In the rest of this section, the MQC constraint is studied for feasibility cases, 
by starting with F easibility{w = A, s^,*), where the MQC region number w 
equals a fixed integer A. 

For each agent i G A and region r G U, let 

ti,r — ^ ^ ^i,j ( 6 ) 

jGJr 

denote the total regional shipment assigned to a selected agent i among all 
transportation requests in the region r G R. (Recall that the universal set is 
U = RU {0}, where the region 0 holds request set Jo not constrained by MQC 
constraint by fixing the minimum quantity = 0-) 

Our basic idea to solve the case Feasibility {w = A, Sj,*) consists of the 
following stages: 

— Stage 1: Select all the agents, i.e. A= {1,2, ..., m}; 

— Stage 2: For each agent i G A and region r G U, generate the total regional 

shipment 

— Stage 3: Based on A and ti^r, obtain the assignment Zij for i G A and j G J. 

As discussed before, all agents are selected in Stage 1 for the feasibility case. 
To see what constraints the total regional shipment needed to satisfy, let us 
examine any feasible assignment Zij for Feasibility {w = A, Sj,*). the demand 
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constraint (2) implies that the total shipments to region r should be the same 
as its total demands for r €U, i.e. 

dj. (7) 

ieA jeJr 

The capacity constraint (3) implies that the total shipments to all regions by 
one agent i should not exceed its maximum capacity Si for i G A, i.e. 

U,r < Si. (8) 

rGU 

And the MQC constraint (5) implies that for each agent i G A and each MQC 
region r G R, the total regional shipment ti^r must satisfy: 

ti^r = 0 or ti^r > br- (9) 

Then, a necessary condition of the feasibility follows, for the case Feasibility (w = 

A, Si,*). 

Lemma 2. If there exists a feasible assignment Zij for i G A and j G J for the 
case Feasibility{w = A, Si, *), we can have its total regional shipment ti^r by (6) 
to satisfy the constraints (7)-(9), where i G A and r gU . 

Moreover, this condition is also sufficient. We show it by proving the following 
result, which partially solves the case Feasibility{w = A, Si, *). 

Lemma 3. Algorithm 2 generates a feasible assignment Zij with i G A and 
j G J for the case F easibility{w = A, Si,*) in 0{nm) time, if the given total 
regional shipment ti^r for i G A and r GU such that the constraints (7)-(9) are 
satisfied. 

Proof Suppose that the total regional shipment ti^r satisfies constraints (7)-(9) 
for z G A and r GU. Note that the regional request sets Jq, Ji, ..., J\ forms a 
disjoint partition of the whole set J. We can obtain feasible assignments Zij of 
all the agents z, for requests j in Jq, Ji, ..., J\ respectively. 

For each request set Jr of a region r, consider the following instance of the 
case Feasibility{w = 0, s^, *). The request set becomes Jr holding requests only 
in region r, and the demand of request j is still dj. The agent set is A, and the 
maximum capacity of agent z changes to U^r- 

In this new F easibility{w = 0,Si,*) instance, by the constraint (7), we can 
apply Theorem 6 to generate feasible assignments for z G A and j G Jr. 
Because the demand is unchanged, also satisfies the original demand con- 
straint (2). Since ti^r > br by the constraint (9), the original MQC rule (5) is 
satisfied for region r. 

Therefore, we solve those Ad- 1 instances of the case Feasibility{w = 0, Si, *) 
for Jq, Ji, ..., J\ one by one. Afterwards, assignments Zij can be obtained for 
z G A and j G J, such that both the original demand constraint (2) and MQC 
rules (5) are satisfied. Recall that for each region r G U the relation (6) holds 
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Algorithm 2 Solving Feasibility {w = X,Si, *) given Total Regional Shipments 



1: Input: The selected set A — {1,2,..., m}, and the total regional shipment ti^r (i e A and r ei U) 
which satisfies constraints (7)— (9); 

2 : Initially, set zij •«— 0 for i G I and j G J', 
for all region r ^ U do 

4 : Construct an instance of Feasibility{w — 0, Sj,*), where the request set is Jr with each 

request j G Jr having demand dj, and the agent set is A with each agent i ^ A holding 
maximum capacity 

5 1 Solve this instance by Theorem 6 to obtain a feasible assignment Zij for i ^ A and j G Jr', 

end for 

7'. Output assignments Zij for i ^ A and j G J', 



true, by (8), we know that Zij satisfies its original maximum capacity con- 
straint (3) for t e A and j G J. 

A formal statement of the above process is given in Algorithm 2. Note that 
Feasibility{w = 0, Si, *) has been solved for \U\ times, each consuming 0{\Jr\m) 
time for 0 < r < A. So totally, the time complexity of Algorithm 2 is 0{nm), by 
I'AsI + |Ti I -h ... -h I Ja| = n. □ 

To completely solve the case Feasibility {w = A, s^,*), we need to obtain 
the total regional shipment ti^r to satisfy constraints (7)-(9) for i G A and 
r G U, before we apply the Algorithm 2. This can be achieved by a dynamic 
programming algorithm proposed as follows. 

Let I{p) = {1, ...,p} denote the set of the first p agents in A (1 < p < m). 
Let Dr = ^3 denote the total demands for requests in region r G C/. To 

satisfy demands in the constraint (7), we trace the partial regional shipments 
transported by those agents i G I(p) for region r, where p is from 1 to m. In 
other words, for each p, let Vr = denote the partial regional shipments 

for region r. So the states of the dynamic programming algorithm are {p,~u), 
where the vector it = {vo,vi, ...jVx) and Vr < Dr for all r € U. The algorithm 
recursively computes values of G(p, if), denoting the true value of the statement: 
there exists integer values of ti^r for i G I{p) and r G U, such that 

ti^r = Vr, for all r G {/; (10) 

iG/(p) 

ti,r < Si, for all i G /(p); (11) 

rGU 

ti r = 0 or ti^r > br, for all i G I{p) and r gU. (12) 

It is easy to see that there exists ti^r to satisfy these constraints (7)-(9) for z G A 
and r G [/, if and only if G{m, {Dq, Di, D\)) is true. So our objective is to 
obtain the value of G{m, {Dq, Di, D\)), which can be computed recursively 
as follows. 

Initially, set G(0, it) true if if = (0, 0, ..., 0); otherwise set it false. 

In each iteration p = l,2,...,m, consider the vector it with field Vr < Dr 
for r G U. To obtain the current value of G{p,lt), we need to enumerate all 
the possible values of tp r for r G U. Note that tp^r can not exceed Vr by (10). 
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Let S' be the set of vectors t = with tp^r < Ur for r G C/ 

and satisfying (11) and (12). Accordingly by (10), we can obtain the following 
recursion: the value of G{p, ~u) is true if and only if there exists a vector t £ 'I' 
such that the value of G{p — l,~u — t) is true. 

After these m iterations, we can examine the value of G{m, {Dq, Di , ..., Dx)). 
If it is true, then the feasible values of satisfying constraints (7)-(9) for z G A 
and r G U, can be derived based on the previous process of recurrent computa- 
tions; otherwise no feasible ti^r exists. Whenever a feasible ti^r is obtained, we can 
solve Feasibility{w = A, s,, *) by Algorithm 2. By a straightforward estimation 
of its time complexity, we can derive the following theorem. 

Theorem 7. Feasibility{w = A, Sj, *) can be solved in + nm) time. 

So far, a pseudo-polynomial algorithm has been designed for F easibility{w = 
A, Si, *). By specifying w = 2, we have: 

Corollary 1. Finding a feasible solution for the case Feasibility{w = 2, Si, *) 
can be solved in 0{mS^ + nm) time. 

On the other hand, the case F easibility{w = 2,Sj,*) is binary AfT^-hard, 
indicating that polynomial algorithm is unlikely to exist. It can be proved by a 
reduction from the Partition problem [5]. (See [8].) 

Theorem 8. Feasibility{w = 2, Si,F) is binary If V -hard. 

The complexity result above can be applied to F easibility{w = A, Si,*), by 
the reduction from w = 2 to w = A in the field a shown in Section 2. This 
derived the following corollary. 

Corollary 2. Feasibility{w = X, Si,*) is binary If V -hard. 

We have seen that the capacitated feasibility case of w > 2 is binary IfV- 
hard and has a pseudo-polynomial algorithm. However, when w = 1, an efficient 
polynomial algorithm exists for Feasibility{w = l,Si,*) where R = {1} and 
U = {0, 1}. Recall that the polynomial Algorithm 2 can be still applied for the 
case Feasibility{w = l,Si,*) to obtain feasible assignments Zij for z G A and 
j G J, if given the total regional shipments for z G A and r £ U . So, we only 
need to consider how to obtain ti^r efficiently for this case as follows. 

Suppose S > D hy Lemma 1. Assume that the agents are ordered non- 
decreasingly on their maximum capacities, i.e. Si > S 2 > > Sm- Moreover, 

let Cl indicate the number of agents, which can serve requests in region 1 whose 
capacities are at least bi. 

If the total regional shipment has been obtained for i £ A and r £ U with 
satisfaction of constraints (7)-(9), it is easy to see that that the YTi=i ^ 
must be satisfied, where c = min(ci, [I?i/6iJ). 

To see this condition is also sufficient, we will show that if 
we can safely choose Ai = {1, ..., c} to serve the requests in region 1 and obtain 
ti^r as follows. Firstly, excluding quantity b\ reserved for the minimum quantity 
of region 1, the remnant capacity of agent z can be denoted by s' = — b\ for 

1 < z < c and the total remnant demands in region 1 is reduced to = Di — cb\. 
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By have XieAi Bet t such an index that c > r > 1 

and XI=i ^ Then the first r agents can satisfy the remnant 

demands D[, since we can assign s' to each agent i where i < t, and assign the 
rest D[ — Xr=i^ te> agent t. Furthermore, including the reserved quantity bi, 
we can satisfy all the demands D\ by assigning agents in A\ with = Si if 
z < T — 1, and = Di~ (c — r)6i — Xl=i^ = 6i if r < z < c. It is easy 

to verify that both the demand constraint (7) and the MQC constraints (9) are 
satisfied for region 1. On the other hand, for region 0, note that the total demands 
is Do = D — Di and the total remnant capacity of all agents is S'o = -S' — -Di- By 
S > D, we have So > Dq. Using the same way discussed above for satisfying the 
remnant demands D[ in region 1, we can obtain the regional shipment for 
z G A to satisfy constraints (7) (9) for region 0. Moreover, it is easy to see that 
the capacity constraint (8) for where z G A and r G C/, is satisfied by all. 

As ti^r has been obtained for i € A and r G C/, we can employ Algorithm 2 
to solve the feasible Zij for z G A and j G J. Because choosing agents with c 
largest capacities consumes 0{m) time [3], the main process in Algorithm 2 cost 
0{n + m) time at most, except the initializations which consumes 0{nm) time. 
So, its total time complexity is 0{nm). This leads the following theorem. 
Theorem 9. F easibility{w = l,Si,*) can be solved in 0{nm) time. 

Lastly, the general capacitated feasibility case with arbitrary w can be proved 
to be unary AfT^-hard by a reduction from 3-Partition problem [5]. (See [8].) 
Theorem 10. Finding a feasible solution for the case Feasibility{w, Si,*) is 
unary AfV-hard. 
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Abstract. In this paper, we present (1) our implementation of a fast 
approximation multicommodity flow algorithm, and (2) our implementa- 
tion of the first provably good approximation algorithm for the minimum 
normalized cut problem. (2) serves as an important application of (1). 
Our experimental results show that the output quality of our approach 
compares favorably against some previous approximation multicommod- 
ity flow implementation and eigenvalue/eigenvector based normalized cut 
implementation. We also show comparisons on the execution times and 
analyze the underlying reasons. 



1 Introduction 

The multicommodity flow problem [1] is concerned with computing the largest 
rate at which flow commodities can be routed from multiple sources to multiple 
sinks in a graph without violating any given edge capacity constraints. This 
problem is essential to solving many optimization problems, and finds substantial 
applications in the area of approximation algorithms. 

The goals of our work are to obtain a fast implementation of a good quality 
approximation algorithm for the multicommodity flow problem, and to use this 
implementation as a main tool to study some important applications. For exam- 
ple, the maximum concurrent flow problem (MCFP) [19] that we study can be 
used to solve the pairwise data clustering problem [25] based on a novel criterion 
called normalized cut [24]. 
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The normalized cut criterion measures both the total dissimilarity among 
different components of a partition as well as the total similarity within the 
same components. Shi and Malik [24] showed that the minimum normalized cut 
criterion is capable of delivering impressive image segmentation results. Since 
the minimum normalized cut problem is NP-complete [24] , they gave a heuristic 
algorithm for solving this problem by computing the eigenvalues and eigenvec- 
tors of the incident Laplacian matrix of the corresponding graph. However, no 
provable quality is known for their approach. 

Wu et al. [25] proposed the first provably good polynomial time combinatorial 
approximation algorithm for the minimum normalized cut problem, by modeling 
it approximately as a linear program (LP) dual of the maximum concurrent flow 
problem [19] and applying the sphere-growing technique [8, 11] . For an undirected 
weighted graph of n vertices and m edges and any fixed value e > 0, they pre- 
sented a (4(l-|-e) -|-o(l)) ln(n)-approximation algorithm in time for the 

minimum normalized cut problem, where 0{f{m)) denotes m). 

The sphere-growing procedure essentially runs Dijkstra’s shortest path algo- 
rithm, in 0{nlogn + m) time [7]. The approach in [25] also improved the ap- 
proximation ratio of the algorithm for computing the sparsest cut [11] by a factor 
of nearly 4. Note that the time for solving the linear program (LP) dual of the 
concurrent flow problem dominates the overall running time of the algorithm 
[25]. Previous experimental results have suggested that concurrent flow based 
approximation techniques can yield much faster running time. 

Given a graph G = (V, E) with non-negative edge capacities u(e) and k > 
1 commodities, such that Si and U are the source and sink for commodity i 
associated with a demand d{i), the maximum concurrent flow problem (MCFP) 
[19] seeks the largest possible rate A such that there is a multicommodity flow 
that routes A • d{i) units of commodity i simultaneously in G for all i. Note that 
in Wu et al.’s approximation minimum normalized cut algorithm, it needs to 
solve a product multicommodity flow problem (PMFP), which is a special MCFP 
with the demand of flow between every pair of vertices u and v being the product 
of u and v’s weights. The MCFP is the dual of the LP relaxation of the sparsest 
cut problem [11], and can be solved optimally in polynomial time using LP 
techniques. However, in practice, the LP based approach may not be practically 
efficient for large size normalized cut problems. 

For many applications, it is more important to solve the MCFP approxi- 
mately in a fast manner than to solve it optimally. Developing fast approxima- 
tion algorithms for the MCFP to produce solutions that are provably close to 
the optimal one has attracted a lot of attention. A series of papers provides fully 
polynomial time approximation scheme (FPTAS) for the problem, based on Lan- 
grangian relaxation and LP decomposition techniques (e.g., [12,16-18,21,22]). 
Building on the framework of [9, 10], Karakostas [15] gave a faster FPTAS for 
the MCFP. This algorithm (to be described in Section 2.3) eliminates the run- 
ning time dependence on k, and runs in time, improving the previous 

best 0{e~^m{m + ky) time MCFP algorithm [9]. To our knowledge, Karakostas’ 
algorithm [15] has the so far best running time. 

In this paper, we report the following work. 
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— Our implementation of the approximation algorithm in [15] for the MCFP. 

— Our implementation of the approximation minimum normalized cut algo- 
rithm in [25]. This uses our implementation of [15] as the main tool, and 
serves as an important application of our implementation of the MCFP. 

— A study of the experimental behaviors of these two algorithms [15,25]. Our 
experimental data show that the practical performance of the algorithms is 
much better than their theoretical time bounds. 

Implementing Karakostas’ MCFP algorithm is a nontrivial effort since the 
algorithm is quite sophisticated. To improve the running time in practice, we 
must modify certain details of the algorithm [15], add and fine-tune some scale 
parameters to control the computations, and apply a number of strategies to 
reduce numerical errors. We describe our changes and the motivations behind 
them in Sections 3. We show and analyze our experimental results in Section 4. 



2 Preliminaries 



2.1 Minimum Normalized Cuts 



Given an undirected graph G = (V,E), with each edge e € E having a weight 
w(e) > 0, a cut C of G is a subset of edges whose removal partitions V into two 
disjoint complementary subsets A and A. The minimum normalized cut is a cut 
C whose normalized cost 



a{A) 



cut{A,A) ^ cut{A,A) 
assoc{A, V) assoc(A, V) 



( 1 ) 



is minimized, where cut{A, A) = X^eeC is the sum of edge weights of the 
cut G, and assoc{S,V) = t;)6£; mgS wev ^ subset S' of P is the 

total weight of edges between S and all vertices in G. Papadimitriou [24,27] 
has shown that computing a minimum normalized cut in G is an NP-complete 
problem. 



2.2 Wu et aids Combinatorial Algorithm 

Wu et al. [25] studied the minimum normalized cut problem on an undirected 
graph G = (P, E), with each edge e having a weight w{e) > 0. The weight w{u) 
of each vertex m in G is defined as 5 X)e=(« w)g£; ^(®)- ^ subset A C P, 

let w{A) denote the total weight of the vertices in A, and IP denote the total 
weight of all edges in E. Then TP is also equal to algorithm 

[25] first introduces a direct fractional relaxation of the minimum normalized 
cut problem as an LP dual of the product multicommodity flow problem [19]. 
This linear program assigns a non-negative edge length l{e) to each edge e G E, 
which will be used by the sphere-growing procedure (to be discussed later). To 
apply the linear fractional relaxation, a commodity is associated with every pair 
of vertices u and v in G, whose demand is set as w{u) • w{v). 
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We need to review some notation. For a subgraph G' = (F', E') of G, let 
N{v, r) = {u \ distG'{u, v) < r} denote a sphere of radius r in G' that grows 
from a vertex v € V', where distciu, v) is the u-to-v shortest path length in G' 
based on the edge length function l(-). Let E(v, r) denote the set of edges whose 
end vertices are both in N{v,r), C{v,r) denote the induced cut of N{v,r) (i.e., 
G{v,r) is the set of edges cutting N{v,r) from V — N{v,r)), and w{C{v,r)) 
denote X)eeC(i> r) ^(®)- volume of a sphere N{v,r), denoted by vol{v,r), 
defines the credits that are associated with the sphere in order to pay for the 
cost of the induced cut (see [25] for the precise definition). 

The basic idea of the sphere-growing process is to run Dijkstra’s single-source 
shortest path algorithm in G' from an arbitrary vertex v to produce a sphere 
N{v, r), such that the ratio i? of w{G{v, r)) over vol(v, r) is logarithmic in terms 
of n. The procedure starts at an arbitrary vertex v S V puts v into a sphere 
denoted by S, and finds a vertex x such that disto'iv^x) is the smallest of 
distG'{v,v') among all v' G V — S. It then checks whether w{G{v,r)) > a 
parameter cr times vol{v,r). If yes, then x is put into the sphere S, and r = 
distG'{v,x). The procedure repeatedly puts new vertices into S', until either the 
total weight w{G(v, r)) of the cut is no bigger than the criterion given above or no 
more vertex is left in V' — S, i.e., the termination condition of the sphere-growing 
procedure is i? < cr or y' = S. The value of cr is a key in determining the upper 
bound of the sphere radius r thus computed, and is chosen as -\/ln(nlnn -I- !)• 

(■\/2'\/ln(n In n -|- 1) -I- 1) in [25], which yields a better approximation ratio than 
the previous work. 

Wu et aids algorithm uses this sphere-growing procedure as a basic function, 
and generates a set of disjoint spheres in G. In all spheres thus produced, it has 
been proved [25] that either one can find a sphere which gives the cut that yields 
a (4 -|- o(l)) In n-approximate result, or can choose a vertex of G from which 
another sphere can be grown to generate the desired normalized cut (see [25] for 
more details). 

2.3 Maximum Concurrent Flows 

We now summarize Karakostas’ approximation algorithm [15] for solving the 
MCFP. As mentioned before, the MCFP can be expressed as a linear program. 
Let D{1) denote the dual objective function ^gU{e) ■ l{e), disti{l) denote the 
length of the shortest path from Si to ti in G based on the length function I, and 
a{l) = ^id{i) ■ disti{l). It has been observed [10] that minimizing D{1) under 
the dual constraints can also be viewed as computing the lengths l{e) for the 
edges e of G, such that D{l)/a(l) is minimized. Let [3 denote the minimum value 
oiD{l)/a{l). 

When (3 > 1, Karakostas [15] gave an time approximation algo- 

rithm that improves on [10]. Initially, for all edges e, l{e) = S/u{e), where i5 is a 
fixed value that is set to (1-|-A:e) ^ • ("^) ' • The algorithm proceeds in phases. 

In each phase, there are [S'] iterations, where S is the set of source vertices. In 
iteration j, consider all commodities ci, C 2 , . . . , that share the same source Sj. 
The objective in this iteration is: For all demands d{cq),q = l,2,...,r, route 
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d{cq) units of flow from sj to the corresponding sink of commodity Cq. This is 
done in a series of steps. In each step, the shortest paths from Sj to the sink of 
each commodity Cq is computed based on the current length function. Let c be 
the minimum capacity of the edges on the shortest paths thus computed for all 
commodities Cq. Then for each commodity Cq, an amount of flow that is equal to 
the smallest value of c and the remaining demand of Cq is sent from Sj along the 
shortest path for Cq. The amount of routed flow is deducted from the demand 
of Cq, and the edge lengths on these shortest paths are updated according to 

S p fc 

a length function l{e) = l{e){l + e ■ — ), where fc^ is the amount of 

flow routed for commodity Cq and is the shortest path from Sj to the sink of 
commodity Cq. The entire procedure terminates when the dual objective function 

D{1) = • Ke) > 1 - 

When /3 < 1, it has been shown in [10] that one can scale the ratio of 
capacities over demands, and use some binary search type methods to ensure 
1<P<2. 

3 Implementations 

We implemented the algorithms using programming languages C/CH — h on a 
Sun Blade 1000 Workstation with a 600 MHz UltraSPARC III Processor and 
512 Megabytes of memory. In this section, we describe some important imple- 
mentation details and our handling of the issues arising in our implementations. 

As mentioned before, Wu et al.’s algorithm works on undirected graphs, while 
Karakostas’ algorithm is designed for directed graphs. To make our implementa- 
tions compatible with both of these two algorithms, for each edge in the original 
undirected graph, we add 2 more vertices and 4 more edges to form an “equiva- 
lent” directed subgraph [1]. By carefully assigning the edge lengths and weights, 
we ensure the correctness of the flow results. 

The maximum concurrent flow algorithm in [15] proceeds in phases; each 
phase consists of iterations and each iteration performs some steps. When the 
whole algorithm terminates, the flow values have been scaled down to a solution 
that is very close to the optimal one without violating any edge capacities (i.e., a 
feasible and good approximation quality solution) . Our experiments have shown 
that on some problem instances for which the feasible, exact optimal solutions 
can be easily found (i.e., all commodity demands can be routed simultaneously 
without violating any edge capacities) , it is really not necessary to waste time to 
route the flow using multiple phases. In this case, we can actually And a feasible, 
exact optimal solution in one phase, reducing the running time dramatically. 

To achieve this goal and for the purpose of speeding up the program for all 
cases, we use an extra copy of the edge capacities of the graph. This copy of 
the edge capacities is modified as the algorithm proceeds while the original copy 
will not be changed. At the beginning of each iteration, the edge capacities in 
the extra copy are reset to the edge capacities of the original graph, and they 
are updated after routing the flow in each step. We also use a flag to indicate 
whether there is an edge whose flow already exceeds its capacity. As discussed 
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in Section 2, after the last step of iteration j, all demands of the commodities 
with the same source sj have been routed. Thus, after all iterations, if the flag 
has not been set, then a feasible and exact optimal flow solution has been found, 
and we can stop the algorithm. In this way, a feasible, exact optimal solution 
can be obtained after only 1 phase of [S'! iterations, where S is the number of 
different sources. 

Correspondingly, the edge lengths along the computed shortest paths are 



updated according to a function 1(e) = ?(e)(l+< 



r- eCZT> flow(e) 

■ " Clle) ). Where /^orc(e) 

denotes the total flow routing on edge e and Ucopy(e) denotes the remaining 
weight of e in the extra edge capacity copy. If Ucopy(e) of an edge e becomes 
zero, namely, the flow routing on this edge has used up all its capacity, then 



He -ef=P f^ow{e) 

we use another length function 1(e) = 2000 • ^(e)(l + e • — - — ). The 

purposes of using different length functions are to assign larger lengths (costs) 
to edges with less capacities left, and to encourage routing flows along other 
less congested edges in the subsequent steps. The scale-up factor reduces the 
possibility of routing flows on edges which are already “full” when some “unfull” 
paths still exist. 

Numerical errors occur frequently in our implementations. To deal with these 
problems, several strategies have been applied in our implementations to achieve 
higher reliability. For example, we use “double” data type, avoid/delay “inac- 
curate” computations such as square root, apply some static error estimates 
schemes, scale up small values if the results will not be affected, and etc.. 



4 Experimental Results 

We have run our programs on a variety of problems. The experiments were 
performed on the same Sun Blade 1000 Workstation. All graphs shown below 
are generated by a random graph generator written by ourselves in C-| — h. 



4.1 Dependence on e 

The maximum concurrent flow algorithm in [15] takes 0(e~^m^) time, where 
0(f(m)) denotes 0(f(m) m). Thus, the theoretical bound shows an in- 

verse polynomial dependence of the execution time on the parameter e for the 
approximation error. 

We ran our MCFP program on various random graphs with randomly placed 
commodities. Let n, m, and k denote the numbers of vertices, edges, and com- 
modities of an input undirected graph, respectively. For example, using a random 
graph with n = 40 vertices, m = 72 edges, and k = 20 commodities, we graphed 
the e values versus the total execution time (in seconds) in Figure 1. The upper 
curve represents the theoretical time bound that we interpolate using the least 
squares method. The lower curve shows the actual execution time for different e. 
Each point on the lower curve represents an average execution time of 10 runs. 
Figure 1 shows that as the e value decreases, the total execution time tends to 
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Fig. 1. Illustrating the dependence of the execution time on e 



increase polynomially; the overall pattern of the lower curve seems to indicate an 
inverse polynomial dependence of the execution time on e. The figure also shows 
that as e decreases, the actual execution time increases in a much slower manner 
than the predicted theoretical time bound. This suggests that as e decreases, the 
algorithm on average tends to run much faster in practice than the worse-case 
theoretical time bounds. 

4.2 Running Time vs. the Number of Commodities 
or Number of Edges 

The MCFP algorithm in [15] shows that by grouping the commodities with the 
same source in the process of each iteration, the dependence of the total running 
time on the number of commodities can be virtually eliminated. 

We tested this hypothesis on a set of maximum concurrent flow problems 
with the same numbers of vertices and edges, but different numbers of com- 
modities. The results show that the total execution time is affected slightly by 
the number of commodities in certain range, due to the change of the [3 value. 
When the number of commodities increases, the (3 value gets smaller, thus the 
total execution time actually decreases. When (3 drops below 1, the scaling pro- 
cedure ensures 1 < /? < 2, hence after that, the execution time becomes pretty 
stable as the number of commodities changes. 

The experimental results also show that with the same commodity number, 
the total execution time increases polynomially as the number of edges increases. 

4.3 Comparison with Leong et aZ.’s Implementation 

Based on our experience, Leong et al.’s implementation is one of the best codes 
that we know for approximately solving the MCFP. Thus, we compare our imple- 
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Table 1. Comparison of our execution Table 2. Comparison of our output qual- 
time (in seconds) with Leong et al.’s (with ity (in percentage) with Leong et al.’s 
e = 0.01) (with e = 0.01) 



Problem size (n, m, k) 


Our work 


Leong et aZ.’s 


(40,86,60) 


2.47 


7.51 


(50,100,60) 


4.22 


4.97 


(55,128,150) 


7.33 


169.60 


(100,202,120) 


17.40 


25.96 


(180,413,250) 


85.25 


114.75 


(250,587,300) 


204.36 


87.41 


PMFP 






(20,47,380) 


2.76 


344.92 


(40,222,1560) 


2.99 


5.32 


(50,305,2450) 


10.15 


15.74 


(100,661,9900) 


35.19 


102.63 


(200,706,39800) 


136.96 


205.41 


(300,1064,89700) 


500.40 


146.38 



Problem size (n, m, k) 


Our work 


Leong et aZ.’s 


(40,86,60) 


39.37 


23.11 


(50,100,60) 


17.30 


33.33 


(55,128,150) 


37.79 


21.36 


(100,202,120) 


25.60 


16.67 


(180,413,250) 


17.60 


16.95 


(250,587,300) 


19.36 


16.67 


PMFP 


% 


% 


(20,47,380) 


45.12 


8.07 


(40,222,1560) 


28.26 


6.67 


(50,305,2450) 


26.18 


7.16 


(100,661,9900) 


6.59 


3.35 


(200,706,39800) 


1.61 


0.15 


(300,1064,89700) 


0.75 


0.03 



mentation with theirs on both the general MCFP and PMFP. Some comparison 
results on the execution time are shown in Table 1, and some comparison results 
on the output quality are shown in Table 2. 

The execution times in Table 1 represent the average times of 10 runs per 
entry. We noticed that our implementation does not really have much advantage 
on the execution time over Leong et oZ.’s. In the 231 problem instances we 
tested, for 61.9% of the test cases, our algorithm performs faster than Leong 
et al.’s algorithm; for 92.61% of the test cases, our algorithm produces better 
quality results (higher percentages of the commodity demands are routed). 

A possible reason that our implementation does not have obvious advantages 
on the execution time may be that Leong et al.’s code uses very straightforward 
and efficient data structures such as arrays, and it deals with only integer lengths, 
demands, and capacities. Our code is designed for more general settings and is 
easier to expand to accommodate additional requirements; these advantages may 
pay a price of incurring more overhead in the execution of our code. Note that 
for problem instances with larger graph sizes, Leong et al.’s implementation 
appears to have a better chance to outperform our MCFP implementation in 
terms of the execution time. A reason to explain this phenomenon is that their 
implementation uses the following heuristic: If the flow result does not improve 
after 2000 consecutive iterations, then their program will think that there is not 
much chance to converge and thus stop the execution. In such situations, their 
program may run faster, but according to Table 2, our approach gives flow results 
with a much better approximation quality, especially for PMFP instances. 



4.4 Performance of Our Normalized Cut Implementation 

Wu et al. [25] have shown that by applying the fast approximation MCFP algo- 
rithm [15], an approximate minimum normalized cut can be computed efficiently. 
Our experiments verified this hypothesis. For most problems we tested, the time 
for the PMFP computation occupies more than 90% of the total execution time 
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of the algorithm [25]. The PMFP computation clearly dominates the total run- 
ning time for computing an approximate minimum normalized cut. 

We also compare the execution times and the output approximate normalized 
costs between our approach and Shi and Malik’s heuristic algorithm [24]. Our 
implementation produces a much better graph partition (smaller normalized 
cost) comparing to their eigenvalue/eigenvector based normalized cut approach. 
However, our execution time increases faster as the graph size gets larger. 

4.5 Some Limitations 

During the implementations and experiments, we did notice some limitations of 
our approaches. These limitations are the potential areas where our approaches 
could benefit from future research. 

First, as described in Section 3, to combine the normalized cut algorithm 
[25] with the MCFP algorithm [15], we convert the input undirected graph to a 
directed graph for the MCFP computation, and then convert the directed graph 
back to the undirected graph for the sphere-growing process. This conversion 
gives rise to a larger size graph, and the MCFP computation may waste time on 
some “meaningless” edges. 

Another difficulty is that the key parameter <5 set in the approximation MCFP 
algorithm [15] is usually too small to be implemented efficiently. This affects both 
the correctness and running time of the programs. Although we have used some 
strategies and heuristics to deal with this problem, we have not yet found an 
optimal way to handle it. 
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Abstract. The supply chain between manufacturers and retailers al- 
ways includes transshipments through a network of locations. A major 
challenge in making demand meet supply has been to coordinate trans- 
shipment activities across the chain aimed at reducing costs and increas- 
ing service levels in the face of a range of factors, including demand 
fluctuations, short lead times, warehouse limitations and transportation 
and inventory costs. The success in implementing push-pull strategies, 
when firms change from one strategy to another in managing the chain 
and where time lines are crucial, is dependent on adaptive transshipment 
scheduling. Yet again, in transshipment through crossdocks, where just- 
in-time objectives prevail, precise scheduling between suppliers, cross- 
docks and customers is required to avoid inventory backups or delays. 
This paper studies transshipment with supplier and customer time win- 
dows where flow is constrained by transportation schedules and ware- 
house capacities with the objective to minimize costs, including inven- 
tory costs. Transportation is provided through flexible schedules and 
lot-sizing is dealt with in the models through multiple shipments. We 
propose polynomial time algorithms or show the complexity of problems 
studied. 



1 Introduction 

Making supply meet demand (Fisher et al. 1994) has been a challenge for many 
industries. In the face of increasing numbers of products, decreased product life 
cycles, inaccuracy of demand forecasts and short lead times, strategies to achieve 
this are a high priority for many firms. This paper studies one such a strategy, 
that of transshipments. 

Transshipment is concerned with how much to ship between which locations 
on which routes and at what times. In classical models, where transshipment 
is studied in the context of network flow (e.g. Aronson 1989, Hoppe and Tar- 
dos 1995), there has been no work which includes inventory and time window 
constraints present in business models. One such model where transshipment 
becomes an important factor is in crossdocking which has become synonymous 
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with rapid consolidation and processing. Napolitano (2000) has described various 
crossdocking operations which include manufacturing, transportation, distribu- 
tor, transportation, retail and opportunistic crossdocking, all of which have the 
common feature of consolidation and short cycle times made possible by known 
pickup and delivery times. Crossdocking networks are, in fact, difficult to man- 
age and require significant start-up investment where crossdocking strategies are 
effective only for large systems (Simchi-Levi 2003). Here, we have sufficient vol- 
umes of goods which are picked up and delivered at crossdocks by vehicles with 
the objective of meeting demand from supply while incurring minimum or no in- 
ventory holding costs. In view of the inverse relationship between inventory and 
transportation costs, where larger lot sizes can reduce delivery frequency and 
transportation costs but increase inventory costs, transshipment from suppliers 
to customers through crossdocks can be managed to exploit this relationship and 
reduce costs only if shippers are able to operate within transportation schedules 
and time window constraints at all points of the supply and demand chain. 

In this work, we study general transshipment networks where vehicles are 
available at fixed times or always available depending on transportation ven- 
dors, and where both shipping and delivery can be only be effected within time 
windows at supply and delivery locations. To cater to manufacturer or warehouse 
constraints, supplier time windows are included. This will allow, for example, the 
shipper flexibility in planning for best shipout times in line with his production 
and operating schedules. Flexible supplier shipout times can benefit attempts 
at integrating the ’’back ends” of supply chains (Simchi-Levi et al 2003) bet- 
ter into the chain. We include inventory holding costs at transshipment points 
or crossdocks as these are burdens that can arise in any crossdocking strategy. 
Techniques using network flow are developed which are easily applied, yet general 
enough to be useful in a range of problems. Where polynomial time algorithms 
are not possible, we provide complexity information for the problem. 

This paper is organized as follows. In Section 2, we give a detailed formulation 
of the basic model as an LP and in Section 3 we derive solutions to the models 
and discuss time complexity of the models. In Section 4, we show that techniques 
can be used when penalties are included. Finally, we provide a short summary 
of this work in Section 5. 



2 The Basic Model 

We represent the basic problem by a bipartite network constructed as follows: 
Let S = {l,...,n} be a set of supply nodes where, for each i G S, Si units of 
cargo is available and can be shipped (released) in the time window [bl,e'l], D 
= m} be a set of demand nodes each requiring dk units of cargo which can 

be delivered (accepted) in the time window [6^, e^], for k G D, and T = {1, ..., /} 
be transshipment centers (TCs), each with inventory capacity Cj and holding 
cost hj per unit per time, for j G T. We take to consist of all scheduled 
routes between points in S and T, i.e., available routes serviced by transport 
providers, where each route has scheduled departure (begin) and arrival (end) 
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times, unit transportation costs, and is represented in the network by a directed 
arc from a supply node to a TC. Similarly, we take S 2 to be the set of routes 
between T and D and construct directed arcs from TCs to demand nodes. Here, 
we assume w.l.o.g. that total supply and demand are equal since the imbalanced 
problem can be easily transformed using dummy-nodes. We can now introduce 
the following basic notation: 

S: set of supply nodes 
T: set of TC nodes 
D: set of demand nodes 
Si', total supply at i 

[61, el]: shipping time window at supply node i 

Vij-. set of arcs from supply node i to TC j in S\, = {(tb,te) : (tb,te) € 

51 }, where tb and te are the begin time and the end time, respectively, of an arc 
from i to j in S'! represented by {tb,te) 

Fj-. set of time points cargo arrives at TC j, i.e., Fj = {te ■ (tb,te) € Vij, 
i = l,2,...,n} 

Cij,tbte- unit shipping cost from supply node i to TC j on arc {tb,te) 
kijfyte'- shipping capacity from supply node i to TC j on arc {tb,te) 
di'. total demand at i 

[6“, e“j: delivery time window at demand node i 

Vjj.: set of arcs from TC j to demand point k in S 2 , = {{ib,te) '■ 

{tb,te) G S 2 }, where tb and te are the begin time and the end time, respectively, 
of an arc from TC j to fc in S' 2 - 

Fj-. set of time points when cargo leaves TC j, i.e. Fj = {tb : (tb,te) G Vjf., 

k = 1,2 , ..., m} 

c^j unit shipping cost from TC i to demand j on arc {tb, te) 

shipping capacity from TC i to demand point j on arc {tb,te) 
hi', unit inventory holding cost per time in TC z 
Ci'. inventory capacity of TC i 

- uo. of units shipped from supply i to TC j through {tb,te) G Si 
Xjk tbte'- units shipped from TC j to demand k through {tb,te) G S 2 

yjt'. inventory of TC j at time t, where t € Lj = Fj [J Fj 

Here, we can sort t G Lj in increasing order, i.e., t\ <t 2 < ... < U < U+i < 
... < t|Lj|) uud take Ati = tt+i —ti,i = 1,2, ...,|Lj| — 1. Our purpose is to find 
the flow in the network which minimizes transportation and inventory holding 
costs in the objective: 

^ ^ ^ ^ Cjk,tbteXjk,tbte AthjPjt 

ies jeT (tb,te)eVij jer keo jerteLj 

( 1 ) 

subject to the following supplier capacity (2), customer demand (3), inventory 
(4,5), transport capacity (6,7), inventory capacity (8) and non-negativity con- 
straints: 
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Xij,tbtej and yjt are non-negative integers for all i G S, j G T, k G D, 

(tb,te) G Vij,(tb,te) G Vij,t G Lj 

The inventory constraints (4,5) ensure that initial and final inventory levels 
are zero, and the inventory level at time tp is the sum of the inventory level at 
time tp-i and the total inbound at time tp minus the total outbound at time tp. 
In the capacity constraint (6) requires that at each TC, the inventory level at 
any time should be no greater than the capacity of the TC. 

3 Fixed and Flexible Schedules 

If the departure and arrival times in the transport schedule are fixed, we call 
the problem a fixed schedule problem. A schedule is a fixed schedule once begin 
times are fixed. It is common, however, that suppliers require the flexibility to 
ship at any time within operating times. This is possible only if shippers have 
flexible schedules. In this case, the problem is a, flexible schedule problem. In this 
paper we focus on the flexible schedule problem and we will mention that those 
cases for flexible schedules have the similar results as those for fixed one. Figure 
1 provides and illustration of this type of problem. 

3.1 Multiple Shipping and Multiple Delivery for Flexible Schedules 

We first consider the problem when both transportation schedules (i.e. Si and 
52 ) allow each source to ship cargo several times within its time window (mul- 
tiple shipping) and each demand point can accept cargo several times within its 
time window (multiple delivery). Here we take arcs (i, j) between S and T to be 
the scheduled routes: {b[, + Uj), (b^ + I,b1 + l + Uj), ... ,(e'’, -I- fj), where 

tij is the transportation time from source i to TC j and we have discretized 
time horizon into unit intervals. Arcs between T and D are created similarly. 
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T.W. Source Crossdock Demand T.W. 



(cost, capacity, time ) (cost, capacity, time ) 




cost per unit per time) 



Fig. 1. A flexible schedule problem 

When we use Time-expand method to create a duplicate of TC j for each fea- 
sible time point (for supply nodes this is a feasible arrival time and for demand 
nodes this is a feasible deliver time), we should consider transportation time. 
For example, take two supply nodes si and S 2 , two demand nodes c?i and d 2 , 
and a TC node c. Assume si has time window [7,14] with a transportation time 
to c equal to 2; for S 2 , di and c? 2 , take these to be [9,12] and 1, [11,13] and 1 
and [10,12] and 2, respectively. Possible arrival time points for si and S 2 are 
then {9,10,11,12,13,14,15,16} and {10,11,12,13}, respectively, and possible de- 
liver time points for di and c ?2 are {10,11,12} and {8,9,10}, respectively. Hence 
the feasible time points for the TC c are {9,10,11,12}. This fixes the number of 
duplicates of c to be 4. Note that a feasible time point can be an arrival time, 
a deliver time or both. We can now give a transformation which can transform 
this case into a minimum cost flow problem (MCFP) in the following 

Transformation A 

Step 1: Taking into consideration time windows and transportation times, find 
all feasible time points for each TC j, and denote Nj be the number of the 
feasible time points of TC j. 

Step 2: Using these feasible time points and original network, construct a new 
network consisting of five partitions using the following rules: 

1. First, generate the third partition. For each TC j, generate Nj duplicates 
by sorting the time points in an increasing order, connect the duplicates to 
form a path beginning from the earliest time point to the latest time point 
one by one with the arcs of cost Ati * hj, capacity cj and directed from 
earlier nodes to later ones, where Ati is the length of time interval between 
the corresponding duplicates. 
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2. In the second partition, generate a set of feasible supply nodes for each TC j 
and connect these to j with arcs of zero cost and infinite capacity according 
the feasibility of the time constraint. Connect those supply nodes which 
represent the same supply node i (i = 1, 2, n) in the second partition to a 
new generated node which belongs to the first partition and represents the 
corresponding supply node i with arcs of cost cy and capacity kij. 

3. The fourth and last partitions are both made up of demand nodes. And 
the generation method is similar to the generation of the first and second 
partitions. 

4. Set all the supply nodes i in the first partition and all demand nodes j in the 
fifth partition to have supply and demand dj respectively and all other 
nodes to have zero demand. The direction of each arc between two partitions 
will be from nodes in the lower-indexed to the higher-indexed partition. 

The above method has a pseudo-polynomial time complexity because the 
duplicate number of each TC depends on the total number of feasible time 
points. Polynomial time complexity, follows easily from the following Lemmas: 
Here, inventory refers to cargo which is not shipped out immediately when they 
reach TCs, i.e., cargo which incurs holding costs at TCs. 

Lemma 1 An optimal solution for the MCFP resulting from Transformation A 
satisfies the following conditions: (1) Inventory can only have been shipped from 
source i to TC j at the end time point e[ of its time window; (2) The demand 
point k can only receive inventory at the begin time point 6^ of its time window, 
(3) If there are flows from supplier i to demand point k through TC j, then there 
is only one flow from source i to demand k through one duplicate corresponding 
to one feasible time point Ujk of TC j, where Ujk G [bij,eij] f][bjj,,, e^-^] yf <f> and 

[6y,ey] = -k tij,el + Ufl and [6'-fc,e'-fe] = [bl - t^^,el - t'-^] and Uj and 
are the transportation times from source i to TC j and from TC j to demand k 
respectively. 

Proof: For (1), assume that in an optimal solution, inventory is shipped from 
source i to TC j at time points t\, ■■■, tp, where t^ G e[ — 1], g = 1, 2, ...,p. 

Then, this inventory cannot be sent out from TC j between [bij, Cij — 1]. Other- 
wise, we can postpone the shipping time of the inventory to the last time it can 
be sent out to TCs, where then feasibility still holds, shipping costs remain the 
same, but inventory cost will be reduced so that the objective value is reduced, 
which is impossible because of optimality. We find that all this inventory will 
flow to the last duplicate corresponding to the time point e\. Further, we can 
also delay shipping this inventory from source i at time el, for which feasibility 
holds, shipping costs remain the same but inventory cost will not increase. 

For (2), move all inventory received by demand point k during to the 

first flow corresponding to time point b^, so that feasibility continues to hold, 
shipping costs remain the same but inventory cost does not increase. 

For (3), if there are several flows from source i to demand k through TC j, then 
we choose one duplicate corresponding to one feasible time point tijk of TC j, 
where tijk G [bij,eij]f][bjf.,ejj^] and merge all the flows into one flow through 
the duplication tijk- It is then easy to see that the feasibility holds, inventory 
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is not affected, and shipping cost remains the same, so that the objective value 
remains the same. □ 

Lemma 2 The multiple shipping and multiple delivery case can he transformed 
to a MCFP for which each TC has m + n duplications. 

Proof: By Lemma 1, for each TC j, we only need to consider those duplicates 
corresponding to time points e[ and 6^ for all z = 1, 2, ..., n; j = 1, 2, ..., m. For 
[6y , ey] e'.fc] (t>, either e\ + ty or 6“ - belongs to [hj.eij] n[6'-fe, e'-^]. 

The total duplications are then easily seen to be m+n. The method of construct- 
ing duplications is different from Transformation A, but the other procedures 
are in Transformation A. □ 



3.2 Single Shipping and Single Delivery for Flexible Schedules 

Here we consider the case when there can be only a single shipment out of each 
supply point within its time window, and, similarly, where each customer can 
accept cargo only once within its time window. In such case, there can be several 
arcs for each supply point, of which only one can be used. Similarly, there can 
be several paths in S 2 leading to a demand point, but where only one will be 
used. A single shipping strategy can be preferred by a shipper as it requires 
less setup costs and fixed costs related to shipping each consignment through 
one or more shipping agents. The following theorems show that this problem 
is NP-complete in the strong sense, even if time windows and capacities are 
relaxed. These hardness results imply no polynomial algorithm exists for the 
single shipping problem. 

THEOREM 1: The flexible schedule single shipping and single delivery prob- 
lem is ~NP-complete in the strong sense, even if supply node time windows and 
transportation capacities are relaxed. The same hardness is valid for the problem 
of finding a feasible solution 

Proof: We give a reduction from the 3-PARTITION problem. Here, our prob- 
lem is: Does there exist a feasible schedule that achieves the objective value no 
greater than K1 From an arbitrary instance of 3-PARTITION, consider the 
following polynomial reduction to an instance of our problem. We have iw sup- 
pliers of S, and w customers D = {1, 2, ..., zu}. For each i G S, let s(z) be its 
provision, while for each j € D, let the demand be B with time window [j,j]. Ex- 
actly one TC c with capacity B and unit holding cost for per unit cargo per unit 
time exists to link suppliers and customers, and both time delay and shipping 
cost are zero for all routes. Let K be zero. We now show that a feasible schedule 
of objective value equal to zero exists if and only if the 3-PARTITION has 
a feasible solution. On one hand, if 3-PARTITION has a feasible solution of 
S\, ..., Syj, then we ship all cargo provided by suppliers in Si to c at time j, 
which satisfies the demands B for each j within its time window [j,j], where 
j = 1, 2, ..., w. It is easy to verify that such a schedule is feasible and its objective 
value is zero. On the other hand, if a feasible schedule exists whose correspond- 
ing objective value is zero, then we can construct a partition by setting Sj to 
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be the subset of z G S' with positive flow to c at time j, for I < j < w. Because 
the objective value is zero and the demands are B at time j, we can assume 
that there is no inventory before time j. We then have *(*) — where 

if then ~ ^ units of inventory will remain in the 

TC incurring (non-zero) cost. This contradicts the zero objective value. Since 
i?/4 < s(z) < i?/2 for z G S, we have |Sj| =3 and because of the single shipping 
constraint, we know that Si, ..., S^ is a feasible partition for the instance of 
3-PARTITION and this completes the proof. □ 

3.3 The Remaining Cases for Flexible Schedules 

The cases, for single shipping multiple delivery and multiple shipping single 
delivery are both NP-complete in the strong sense, even if supply point time 
windows and transportation capacities are relaxed. The same hardness is valid for 
the problems for finding a feasible solution. As the proof is similar to those given 
in Section 3.2, we omit these here. Another important result is that for the case 
multiple shipping and multiple delivery without inventory capacity in TCs, we 
can enhance our model to deal with continuous time, and similar transformation 
method can be used to solve it in polynomial time complexity. 

3.4 Cases for Fixed Schedules and Mixed Schedules 

When the schedules are fixed, we still have the same result for each case as that 
for flexible schedules. We also study the mixed schedules that are a combination 
of a fixed schedule and a flexible one. These conditions can occur when, say, a 
fixed schedule is the only one available to shippers or the schedule preferred by 
shipper. By working on such kind of problems, we also And that each case for 
mixed schedules still has the same result as that for flexible one. Because the 
limitation of the paper length and the ideas and methods for fixed schedules and 
mixed one are similar to the flexible one, we omit the details. 

4 Other Applications 

The transformation techniques used for transshipment problems developed here 
can be used for other problems. Here we only describe how to apply them to 
the extended problem with penalties on suppliers. It is common in practice to 
penalize late shipments which can incur significant costs to the entire supply 
chain. In the following, we show the techniques developed are easily adapted to 
penalties imposed on shipments made outside of supplier time windows. To do 
this, we add penalties directly to the shipping costs in the which we represent 
as piecewise functions where shipping costs Cij^t^te for z G S' and j G T can be 
defined as follows: 

T if tb < 

if4G[6r,e[] 

Wij^tite + if fb > Ci 
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where Wij^me are original unit shipping costs, and aij^t^te and are positive 

and represent unit earliness and tardiness penalties respectively. Under these 
assumptions, we can apply the techniques already developed to the following 
cases. 

4.1 Flexible Schedule Problems with Penalties 

Here we only discuss the cases for flexible schedule, and those cases for fixed 
schedules and mixed one have the same results by using the similar ideas and 
methods. In the multiple shipping and multiple delivery case, each path from 
supply to demand through a TC is now feasible. Here, paths can be of three 
types: early, on-time and tardy, where early paths include paths which can only 
begin earlier than the time window, on-time paths are those which begin within 
the time window, and tardy paths are those paths which begin later than the 
time window. As mentioned before, applying Lemma 2, we have improved the 
Transformation A to give a polynomial solution. Now we modify this improved 
Transformation A to solve the penalty case. In this case, if bij > €jf,{bij and 
Cji. are defined as in Section 3.1), we can choose as a extra feasible time 
point belonging to a early path, and the total number of such early feasible time 
points is no more than m. We then find that, similar to the previous analysis, the 
number of duplicates of each TC is no more than 2m + n, where the additional m 
duplicates arise from early feasible time points. We connect supply nodes in the 
second partition to the duplicates of each TC in the third partition according to 
the following rules: for the on-time path, i.e.[6y, Cy] connect 

supply i to the duplicate representing time point or 6^-^; for a tardy path, i.e. 
Cij < bji^, connect supply i to the duplicates representing time point e^- and bjf, 
respectively; and for early path, i.e. bij > e^-^, connect supply i to the duplicate 
representing time point e^^,. And those added arcs have infinite capacity and 
costs corresponding to their path types. After above modification, we can use 
the modified transformation method to solve this case. 



4.2 The Remaining Cases for All Problems 

The remaining cases for all problems in section 3 are strongly NP-complete prob- 
lems and the results of their corresponding supplier penalties problems continue 
to hold with similar proofs. 

5 Summary 

We have studied transshipment through crossdocks which includes inventory and 
time window considerations. The objectives for this study has been twofold: (1) 
to find cost optimal schedules in transshipment networks with time-dependent 
supply and demand which satisfy transportation schedule constraints, and (2) 
to meet the primary objective of minimizing inventory at crossdocks within the 
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context of (1). The techniques used are easily implemented, where for polyno- 
mially solvable cases, we provided transformation techniques which reduce the 
problem to a flow problem. Other cases were shown to be NP-complete in strong 
sense. 
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Abstract. Chen and Kanj considered the Vertex Cover problem for 
graphs with perfect matchings (VC-PM). They showed that: (i) There is 
a reduction from general Vertex Cover to VC-PM, which guarantees 
that if one can achieve an approximation factor of less than two for 
VC-PM, then one can do so for general Vertex Cover as well, (ii) 
There is an algorithm for VC-PM whose approximation factor is given 
as 1.069 + 0.069d where d is the average degree of the given graph. In 
this paper we improve (ii). Namely we give a new VC-PM algorithm 
which greatly outperforms the above one and its approximation factor 
is roughly 2 — • Our algorithm also works for graphs with “large” 

matchings although its approximation factor is degenerated. 



1 Introduction 

Everybody agrees that Vertex Cover is one of the most important combinato- 
rial-optimization problems, which has a variety of both practical and theoretical 
applications. Another important aspect of this problem is its approximability: 
There has been a well-known, straightforward algorithm which achieves an ap- 
proximation factor of two, but in spite of much effort, no one has succeeded in 
providing an upperbound of 2 — £ or a lowerbound of two. (The current best 
lowerbound is 1.3606 [6]). Under the assumption that achieving a general up- 
perbound of 2 — £ is quite hard, our common approach is to investigate several 
restricted cases which give us some insight on the general goal and/or which are 
important in practice. In fact there is a large literature along this line (e.g., [11], 
[3], [10], [1], [12], [4]). _ ^ ^ 

In [4], Chen an Kanj considered this problem for graphs with perfect match- 
ings, which they called VC-PM. They gave an interesting reduction from the 
general Vertex Cover to VC-PM which somewhat preserves the approxima- 
tion factor. Using this reduction, they proved that if one can achieve an approx- 
imation factor of less than two for VC-PM, then one can do so for the general 
problem as well. Thus, for the purpose of breaking the barrier of two, it is enough 
to attack VC-PM. Unfortunately, however, the specific VC-PM algorithm they 
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presented in [4] does not seem to have much gain in this sense. Their approxi- 
mation factor, given as 1.069 -I- 0.069d, becomes two when d = 13.5, where d is 
the average degree of the given graph. 

Our contribution. We give a new algorithm for VC-PM, whose approximation 
factor is illustrated in Fig.l. One can see that it is much better than that of 
Chen and Kanj’s and also outperforms the well-known algorithm by Halldorsson 
and Radhakrishnan when it is run for graphs with perfect matchings (the latter 
algorithm is actually for obtaining an independent set, but can also be used for 
the current problem) . Our algorithm also works for graphs with large matchings 
(whose size is smaller than n/2) although the approximation factor is degener- 
ated. 




Fig. 1. Comparison to the previous re- 
sults 




Fig. 2. Approximation factor of VCPM- 
Apx(w) for each w 



The basic idea of the new algorithm is to use the Weighted Max-2Sat 
and adjust the weighting function according to the average degree of the instance 
graph. Thus our algorithm A is actually a sequence of algorithms A(l), A(2), . . . , 
A{w ), . . . , parameterized by w (the weight for matching edges). For each w, A(w) 
is better than any other A{w') in some range of d as illustrated in Fig. 2. 

Previous work. For general Vertex Cover, finding any maximal matching 
in the given graph and choosing all vertices in the matching yields a factor-2 
vertex cover. However, no approximation algorithm of factor 2 — e is known 
for any constant e > 0. The current best approximation algorithms are those 
found by Bar- Yehuda and Even [2] and Monien and Speckenmeyer [14], whose 
approximation factor is 2— i one found by Halperin [10] which achieves 

a factor of2— (1 — o(l)) ^ " . There has been no further progress on this general 
bound for about two decades. 

However, for the restricted version of the problems, many good algorithms 
are invented. For graphs whose maximum degree is bounded by Z\, Hochbaum 
[11] presented 2 — ^ factor using a coloring method. Exploiting Halldorsson and 
Radhakrishnan’s [9] approximation algorithm for the Independent Set problem, 
one can obtain factor of 2— ^ semidefinite programming, Halperin 

[10] asymptotically improved these results by presenting an approximation algo- 
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rithm of factor 2 — (1 — o(l)) . For small A, Berman and Fujito [3] achieved 

a factor of 2 — For planar graphs this problem admits PTAS [1]. For e-dense 
graphs, that is, for those graphs where each vertex has at least e\V\ neighbors, 
Karpinski and Zelikovsky [12] showed that there is an approximation algorithm 
of factor 2/(1 + e). For graphs whose average degree can be represented as e\V\, 
Karpinski and Zelikovsky [12] also showed that it is approximable within a factor 
of 2/(2- \/r^). 

For VC-PM, other than the algorithm in [4] whose approximation factor is 
1.069 + 0.069d, only the algorithm by Halldorsson and Radhakrishnan [9] seems 
competitive in terms of the approximation factor parameterized by the average 
degree. Its approximation factor is given as 2 — 5/ {2d + 3). 

For the negative direction, Hastad [10] showed that it is NP-hard to approx- 
imate Vertex Cover within a factor of less than 7/6 = 1.1666. Dinur and 
Safra [6] improved this lowerbound to 10-\/5 — 21 = 1.3606. Even for graphs 
whose maximum degree is bounded by Z\ > 3, this problem is APX-complete 
and for large A it is NP-hard to approximate within 1.3606 [5]. As a most recent 
result, Khot and Regev [13] showed that, if a certain conjecture about the PCP 
theory is true, we can prove the nonexistence of 2 — e factor approximation al- 
gorithm for Vertex Cover. For VC-PM, using the reduction introduced by Chen 
and Kanj [4], we can easily obtain a lowerbound of 5-\/5 — 10 = 1.180. 

2 Notations and Definitions 

We employ standard terminologies in graph theory. Let G = {V,E) be a graph. 
For a subset W CV, the induced graph G{W) = {W, E{W)) is the subgraph of 
G induced by W, i.e. E{W) = {{u,v) : u,v €W and {u,v) G E{G)}. We denote 
by W the complement of IF C V, that is V — W. For every vertex v G V, Na{v) 
is the set of vertices adjacent to v. The degree dciv) of w in G is the size of 
Ng{v), namely dciv) = |V( 3 (ti)|. The average degree c? of a graph G is defined 
asd(G) = 2|A|/|V|. 

Vertex Cover and Independent Set are fundamental NP-complete prob- 
lems [8]. A set G C V of vertices is a (vertex) cover of a graph G if every edge 
of G has at least one endpoint in G. Let Opt{G) be the size of a smallest ver- 
tex cover of G. Vertex Cover is a computational problem that asks for an 
optimal-size vertex cover of a given graph. Although Vertex Cover is NP- 
hard, polynomial-time algorithms are known that output a vertex cover G of 
any given graph G such that |G|/|Opt(G)| < r, where r > 1 is a real number 
called an approximation factor of the algorithm. A set / of vertices of a graph 
G is an independent set of G if every edge of G has at most one endpoint in 
I. An algorithm for Independent Set is said to achieve a factor r if it out- 
puts an independent set / of a given graph G such that Optjg{G)/\I\ < r, 
where Optjg(G) is the size of a largest independent set of G. If a given graph 
G satisfies Opt{G) > then one can use an r-approximation algorithm for 

Independent Set as a (2 — l/r)-approximation algorithm to obtain a vertex 
cover of G, by letting the algorithm find an independent set of G and output its 
complement as a vertex cover of G. 
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This paper studies Vertex Cover with Perfect Matching (VC-PM), 
which is a version of Vertex Cover where the instance graph is promised to 
have a perfect matching [4]. 

We use an approximation algorithm for Weighted MAX-2SAT. Fix a 
set X of Boolean variables. An assignment cr to A is a mapping from X to 
{TRUE, FALSE}. A CNF-formula F is a Boolean formula consisting of a con- 
junction of disjunctions of literals in {x^x : x S A}, where each disjunction is 
called a clause of F. A CNF-formula is weighted if each clause is assigned a real 
number, called the weight of the clause. The weight |ct|f of an assignment ct to A 
with respect to F is the sum of weights of the clauses satisfied by cr. Let Opt{F) 
be the weight of an heaviest assignment with respect to F. An r-approximation 
algorithm of Weighted MAX-2SAT computes an assignment ct of a given for- 
mula F such that Opt{F)/\a\F < r. An assignment cr is said maximal if for 
each variable x such that <j{x) = FALSE, flipping a{x) from FALSE to TRUE 
decreases |cr|i^’. We abbreviate |cr|i^’ as |cr|, if there is no confusion. 

3 Our Algorithm 

Let G = (V,E) be an instance graph of VC-PM, so G is assumed to have a 
perfect matching. The goal of VC-PM is to obtain a vertex cover C of G such 
that \C\ = Opt{G). We denote n = \V\ and m = \E\. 

3.1 Basic Ideas 

For finding an approximate feasible solution G of G, Chen and Kanj has taken 
the following strategy (see Lemma 4.1 of [4] for details): Let 

Fg= /\ (a;„Va:„)A f\ (T„VT„). 

{u,v)GE {u,v)GM 

Thus, Eg is an unweighted CNF-formula having variables in a set Xg = {xu ■ 
u G V}, and having m -I- n/2 clauses. Regarding this formula as a MAX-2SAT 
instance, they apply a MAX-2SAT approximation algorithm to obtain some 
assignment a to Xg- Without loss of generality this assignment is assumed to 
be maximal. Then, they give the following vertex set of G: = {v G U|cr(a;„) = 

TRUE}. Let us call the m clauses (a;„ V Xy) such that (u,v) G E the positive 
clauses of Eg and the other n/2 clauses, (x„ V Xy) such that (u,v) G M, the 
negative clauses. Obviously T„ covers all edges in the graph G if and only if 
the underlying assignment cr to Xg satisfies all positive clauses of the formula 
Eg- In addition, every maximal assignment satisfies all positive clauses. Thus, 

is a vertex cover of G and their algorithm outputs it. On the other hand, 
the negative clauses of Fg play the role of bounding the covering size \T„\. As a 
whole, if we find an assignment which satisfies as many clauses as possible, then 
it turns out to be a “good” assignment for our objective. 

Here comes the major difference between our new algorithm and the Chen 
and Kanj’s: We also use the same CNF-formula Fg, but as an weighted MAX- 
2SAT instance. Each positive clause has weight of one as before but each negative 
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clause has weight of w which may be greater than one. Intuitively adopting a 
larger w may derive a smaller |7^|, namely, we can make the role of negative 
clauses more powerful. But, unfortunately, this setting may cause a drawback 
that a maximal assignment a may not satisfy some positive clauses of Fq- In 
order to remedy this drawback our algorithm finds another vertex cover Cjp 
of the subgraph induced by the vertices in Fa = {v G V\a{xv) = FALSE}. 
Obviously the union 7^ U Cjp covers the original graph G and thus the algorithm 
outputs it as a vertex cover of G. 

3.2 New Algorithm 

Figure 3 presents a formal description of our algorithm, which we call as VCPM- 
Apx(w), VCPM-Apx(l) coincides the algorithm by Chen and Kanj, because 
G{Fa) contains no edge (by Lemma 1) and so the empty set can be given as a 
covering of G{Fa) in step 6. 



1. Input a graph G — {V, E) with a perfect matching M. 

2. Construct a formula Eg- 

3. Solve Eg and let a be the resulting approximate solution. 

4. Refine a to meet maximality. 

5. Set % = {ve V\a{xv) = TRUE} and Fa = {v G V\a{xa) = FALSE} 

6. Find a vertex cover Cjf for G{Fa)- 

7. Output Ta U Gt- 



Fig. 3. VCPM-Apx(w) 



In order to complete this algorithm we need to specify a weighted MAX- 
2SAT algorithm for step 2 and a Vertex Cover algorithm for step 5. For the 
former one we adopt one of Feige and Goemans [7], cerebrated to achieve the best 
known approximation factor r = 1.0741 • • •. In the next section we show that, 
if an assignment cr is maximal then the maximum degree of G{Fa) is bounded 
by w — 1. For the latter one, we therefore adopt one of approximation algo- 
rithms that perform better for bounded degree graphs. In this paper we adopt 
the greedy Independent Set algorithm. Beginning from G{Fa), the greedy al- 
gorithm picks a minimum-degree vertex and removes the adjacent vertices from 
the current graph. The greedy algorithm finally reaches to an independent set / 
and outputs its complement Fa — I as a, covering Gy^^ of G{Fa)- If the maximum 
degree of G{Fa) is bounded by w — 1, then we have \Gj^^ \ < (1 — l/w)\Fa\- We 
use this upperbound to derive an upper bound of the approximation factor of 
VCPM-Apx(w). See Lemmas 5 and 6 for details. 

Remark. In Section 1 we have introduced much better approximation factors 
of Vertex Cover. These algorithms can improve the approximation factor 
\Cj^^\/Opt{G{Fa)) if they are applied to G{Fa), but we actually need better 
upper bounds of both |Cjr^ | and |Cjp^ \/ Opt{G{Fa)) to get a better upper bound 
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of the objective factor \Ta U |/Opt(G). In addition, the upperbound \Cj=^ \ < 
(1 — \/w)\Ta\ is optimal when the graph G{Ta) consists of a disjoint union of 
complete graphs of size tc — 1. 

4 Approximation Factors of Our Algorithms 

In this section we give approximation factors of our algorithm VCPM-Apx(w). 
Let G = (F, E) be an instance graph and Fq be the associated formula. Let a 
be an assignment to : n G F} obtained by an r-approximation algorithm 
applied to Eg and refined to meet maximality. Let = {t> G V\a{xy) = TRUE} 
and = {v & V\a{Xy) = FALSE}. Let Gjr be a cover of G{T) obtained by 
applying the greedy algorithm to G{T). Let G = let ac be the associated 

assignment, i.e. ac(xv) = TRUE <G> n G G. Let 

S= (|cr| - |crc|)/n. 

Lemma 1. The maximum degree of G{Ta) is hounded hy w — 1. 

Proof. Assume that there is a vertex v in whose degree in G{Ta) is greater 
than or equal to w. Since all positive clauses (a;„ V Xy) such that (u,v) G 
E(G(Ecr)) and u G are unsatisfied by a, if one switches cr(a;„) from FALSE 
to TRUE then at least w positive clauses are newly satisfied. On the other hand 
this change of cr may turn the truth value of the negative clause {xu V Tt,) with 
(u,v) G M to FALSE hence we may lose weight by w. As a whole |cr| doesn’t 
decrease by that change. This contradicts the maximality of cr. 

Lemma 2. Let G he a graph, C he any vertex cover of G and ac he the associ- 
ated assignment, then 

|G| = n+ —(to - |crc|). 
w 

Proof. Since G is a cover of G, the assignment ac satisfies all to positive clauses 
of Eg, so I (Tel — TO is equal to the sum of weights of the negative clauses satisfied 
by ac. On the other hand, either m G G or n G G is met because (u, n) G M and 
G is a vertex cover. In addition, u ^ G or v ^ C, since (x„ V Xy) is satisfied by 
ac. Thus, it follows that the number of negative clauses of Eg satisfied by ac is 
equal to the number of vertices uncovered by G, that is n — \G\. Each of these 
negative clauses has weight w, so we obtain \ac \ — rn = w(ji — |G|). The lemma 
follows by expelling |G| from it. 

Lemma 3. 

Opt{G) > n + — (to — Opt{Fc)). 
w 

Proof. Let G be a minimum vertex cover of the instance graph G and let ac be 
the associated assignment. By lemma 2, it follows that Opt{G) = \G\ = — 

\ac\)/w. Since Opt{Fc) is the largest weight with respect to the formula Eg, we 
obtain Opt{Fc) > |crc|- Therefore n + (to — \ac\)/w > n + ^{m — Opt{Fc)). 
The lemma follows. 




138 



Tomokazu Imamura, Kazuo Iwama, and Tatsuie Tsukiji 



Lemma 4. 



\C\ ^.2 

Opt{G) ~ w 



{d{G) + w). 



Proof. The approximation factor of G is bounded from above as follows: 



|C| 

Opt{G) 



n + (m — \ac\)/w 

Opt{G) 

n + (m + 6n— \a\)/w 
Opt{G) 

n + (m + 6n— Opt{FG)/r)/w 
Opt{G) 

n + ^{m - OptiFc)) ^{5n + ^Opt{Fa)) 
Opt{G) Opt{G) 



< 1 + 



^{Sn + ^(m + wn/2)) 
n/2 




w 



r — 1 
wr 



{d + w). 



The first equality follows from lemma 2, the second equality follows from the 
definition of S, the third inequality follows from a factor r of approximation to 
obtain cr, the fifth inequality follows from lemma 3, the fact that Opt{FG) < m+ 
wnl2, and the assumption that G has a perfect matching and so Opt{G) > n/2. 



Now we have obtained an approximation factor of C . But, in the right hand 
side of this inequality, we have 5 whose value is unknown. So this estimation 
cannot be used to obtain an approximation factor of our algorithm, because 
when 5 is large this upper bound exceeds 2 which is the trivial upper bound for 
general Vertex Cover. Instead, in the following we show another estimation 
of G that can be used when 6 is large. 

Lemma 5. 

kl - kcl “ w{w - 1) 

Proof. We apply the greedy algorithm to G{Fa). Let / = \T„ — Cj:\. Let 
vi,...,Vf be the independent vertices picked by the greedy algorithm in this 
order. Let Co = 0, and Gi = Gi-i U {Nc{vi) n F^), for each 1 < i < f. Thus 
Cf = CjP. Let Eo = E{G{Fa)), and Ei be the set of edges in G{Fa) that are 
adjacent to none of the vertices in Ci. Let Hq = G{Fa) and Eli = (Ea,Ei), 
for each 1 < z < /. Let ai be the assignment to {xv : v G V{G)} such that 
o'i{xy) = TRUE u G 7/ U Gi. Thus cr/ = ac. 

Then, Ci = Ci-i U (vi) for each 1 < z < /. Particularly, {ui, . . . , Vf} n 

C^ = 0, hence \E„\ - |C^| > /. 

Let A = w—1. Lemma 1 shows that dG(j^„) < A. We claim that — |(Ti| < 
A{A + 1) /2 for each 1 < z < /. 
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Case 1: Hi-\{N Hi_i{vi) U {t^i}) is the Z\-complete graph. The assignment (Ji is 
obtained from <Ti_i by flipping the truthvalue cri_i(a;„) for each v € NHi_i{vi) 
from FALSE to TRUE. Thus, for each edge (u,v) of U {wi}), 

the positive clause (a;„ V a;„) is unsatisfied by (Ti_i but satisfied by ai. Since 
Hi-i{NHi_i (ui)U{ui}) is the Z\-complete graph, it contains A{A+l)/2 edges. So, 
the contribution from the positive clauses of Fq to |(Ti_i | — |(Ji| is at most —A{A+ 
l)/2. On the other hand, there are at most = A edges (m, u) G M 

such that {u, v}nNHi_i{vi) yf 0, and the corresponding negative clauses (x„Vx„) 
may be satisfied by ai-i but unsatisfied by ai. So, the contribution from the 
negative clauses of Fq to |cri_i| — \ai\ is at most Aw = A(A + 1). As a whole, 
we have |cTi_i| — \ai\ < A{A + 1) — A{A + l)/2 = A{A + l)/2. 

Case 2: U {fi}) is not the A-complete graph. Let i = dHi_^{vi). 

Since Vi is picked by the greedy algorithm from V (Hi-i), (u) > i for every 

vertex u G NHi_i{vi). In addition, (ui) U {ui}) is not the A-complete 

graph, so £ < A. There are at least £{£+ l)/2 edges of iLi_i that are adjacent to 
some vertex in (vi), whose associated positive clauses of Fq are unsatisfied 

by (Ti_i and satisfied by ai. So, the contribution from the positive clauses of Fq 
to |CTi_i| — \ai\ is at most —£{£ + l)/2. On the other hand, as in Case 1, there 
are at most £ = \NHi_i{vi)\ matching edges whose associated negative clauses 
may be satisfied by ai-i and unsatisfied by ai. So, the contribution from the 
negative clauses of Fq to \ai-i \ — |cTi| is at most £w = £(A + 1). As a whole, we 
have \ai-i \ — |cTi| < £{A + 1) — £{£ + l)/2 = £{2A — £ + l)/2, which is smaller 
than A{A + 1) /2. 

Now, we have \ai-i\ — \ai\ < A(A + l)/2 for every 1 < t < /, so |(t| — \ac\ = 
k*-i| - + l)/2- On the other hand, \F„\ - \Cj^\ > /, thus 

\^.\-\C^\ > 2 ^ 2 

kl ~ kc| ~ A{A + 1) w{w — 1) 



Lemma 6. 



|g| 

Opt{G) 



< 2 - 



46 

w{w — 1) 



Proof. Since C = Tj,. U Cyr, 



\C\ < n 



{\F,\-\CA) = n- 



2Sn 

w{w — 1) 



where the last equality follows from Lemma 5. Since the instance graph G has 
a perfect matching, Opt{G) > n/2, so = 2 - 

The two bounds in lemmas 4 and 6 are oppositely related as functions of 6. 
Their combination thus provides the following approximation factor of VCPM- 
Apx(w). 
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Theorem 1. VCPM-Apx(w) achieves an approximation factor 

o ^ {l-r)d + w 
rw{w + 1) 



for graphs of average degree d. 



Proof. We choose the better one of the two approximation factors given by 
lemmas 4 and 6, respectively. That is, we set 



a = 



w{w — 1) 



r — 1 — 

1 {d + w) 



2{w + l) 

and use lemma 4 if <5 < a, else we use Lemma 6 and obtain 



\C\ ^ 2 _ n (^-r)d + w 

Opt{G) ~ rw{w + 1) 

This theorem gives a series of approximation factors of VCPM-Apx(w) for 
w = 2,3,4,... (see Figure 2), so we take their lower envelope. Particularly, 
since r > 1, choosing appropriate w according to d gives us a factor of less 
than two for all d. In more precise, for d such that (w — l)/2(r — 1) < d < 
w/2{r — 1), VCPM-Apx(w) works better than VCPM-Apx(w') for any w' yf w. 
So we combine VCPM-Apx(w) accordingly, adopting VCPM-Apx(w) if G meets 
{w — l)/2(r — 1) < d{G) < w/2{r — 1). It achieves an approximation factor 

2 - ^ ^2(r- I)d+ I.O- \J{2{r-l)d+ I.O)^ - I 




which is roughly 2 — by using r = 1.0741 in [7]. 

Figure 1 compares approximation factors of the lower envelope of VCPM- 
Apx(w), the Chen and Kanj’s algorithm, and the Halldorsson and Radhakrish- 
nan’s. VCPM-Apx(w) outperforms theirs for all d. We note that the Halldorsson 
and Radhakrishnan’s algorithm has an approximation factor of 2 — for any 

graph such that Opt{G) > (not only the graphs with perfect matchings). 

Remark. In [4], an interesting reduction is shown. It reduces an r-approximation 
algorithm for VC-PM to a max{1.5, (2r + 2)/3}-approximation algorithm for 
general Vertex Cover. Unfortunately, however, the reduction of neither their 
algorithm nor ours derives improvement on approximation factor of the general 
Vertex Cover, since their reduction does not preserve the average degree. 

A graph G = (V,E) is called everywhere fc-sparse if W' C V \E{G{V'))\ < 
fc|U'|. Note that everywhere /c-sparse graphs have average degree k/2. It is also 
shown in [4] that for everywhere fc-sparse graph the average degree is preserved 
by the reduction and our algorithm can find an approximate solution of factor 
max{1.5,2- 2 ^^}. 
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5 For Graphs with Large Matchings 



In this section we analyze VCPM-Apx(w) when they are applied to general 
graphs. We show that it performs well for graphs that have “large” matchings, 
although its approximation factor is degenerated. 

Let G = {V, E) be an instance graph, where n = \V\ and m = \E\. Let M be 
one of the maximum matchings of G which is not necessarily perfect, and let pn 
be the number of vertices in M, where 0 < p < 1 is the ratio of the size of M 
over the size of perfect matching. Given such a matching M, we construct the 
formula Eq as in section 3. 

In the following we show lemmas 7, 8, 9 and theorem 2, extending lemmas 2, 
3, 4 and theorem 1, respectively. We show the proof of only Lemma 8 and omit 
the others since they are slight modifications of the original ones. 

Lemma 7. \G\ <n + ^{m — |crc|) 

Lemma 8. Opt{G) >pn+ — Opt{EG)) 

Proof. We separa te V into the fo llowing four sets: Co = V (M) (IC, Iq = V (M) n 
C, Cl = V{M) n C and Ii = V{M) n C. It is clear that Opt{G) = \C\ = 
\Co\ + \Ci\ = n-\Io\-\h\. 

Since C is a vertex cover of G, ac satisfies all positive clauses of Eg- On 
the other hand, if a negative clause V Xy) is satisfied by crc then one of the 
variables in {xu,Xy} is assigned to TRUE and the other to FALSE. Therefore, 
the number of negative clauses satisfied by aG is equal to |/o|. It follows that 
|(Tc| = m+w|/o|. In addition Opt{EG) > \(tg\, so we obtain Opt{EG) > m+w|/o|. 
Then, the following inequality holds. 



Opt{EG) > m + w|/o| 

= m + w{n — Opt{G) — |/i|) 
> m + w{pn — Opt{G)) 



The lemma follows by expelling Opt{G) from it. 

Lemma 9. 



| 0 | 

Opt{G) 



2 2 

< - - 1 + — 5 - 
p pw 



r — 1 
wr 



-d{G) 



These lemmas provide the following approximation factor of VCPM-Apx(w) 
for graphs with large matchings. 

Theorem 2. VCPM-Apx(w) achieves an approximation factor 



2 ^d + w 

2 ^ 

p rw{w + 1) 



for graphs of average degree d and having a matching of size at least pn/2. 
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Halldorsson and Radhakrishnan’s algorithm applied to graphs with large 
matchings achieves a factor For large p, VCPM-Apx(w) is better 

than theirs. For example, when p = 0.95 our algorithm outperforms theirs when 
d > 1.528, and reaches to 2 when d = 50. When p < 0.825, on the other hand, 
our algorithm becomes worse than theirs for all d, although in this case both 
can have factors of less than 2 only when d < 7. 
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Abstract. The problem of efficiently monitoring the network flow is 
regarded as the one to find out the minimum weighted weak vertex cover 
set for a given graph G = (V, E) with weight function w. In this paper, we 
show that a weak vertex cover set approximating a minimum one within 
2 — can be efficiently found in undirected graphs, and improve the 
previous work of approximation ratio within Ind + 1, where d is the 
maximum degree of the vertex in graph G. 



1 Introduction 

As internet growing rapidly, more and more network applications need to know 
the network flow information to monitor network utilization and performance. 
Most monitoring technologies typically assume that the measurement instrumen- 
tation can be either intelligently distributed at different points in the underlying 
network[l] or placed at the endpoints of the end-to-end path whose character- 
istics are of interest [2]. And now, more and more data must be collected by 
network monitoring process at much higher frequencies. Then the overhead that 
monitoring method imposes on the underlying router can be significant and ad- 
versely impact the router’s throughput and severely impact its performance. 

We describe the problem of efficiently monitoring the network flow as finding 
the minimum Flow Monitoring Set of a graph G. Formally, 

Definition 1 [3](Flow Monitoring Set) Given an undirected graph G=(V,E), 
where V denotes the set of nodes, E represents the edges between two nodes, we 
say S C V is a flow monitoring set of G, if monitoring the flow of those edges 
that are incident on nodes in S is sufficient to infer the flow of every edge in E. 
And the following two constraints must be satisfied: 

(1) \/v € V, Degreefv) > 2, where Degree{v) denotes the degree of node v 

(2) Vu G U. f{u,v) = 0, where f{u,v) denote the flow from node u to 

node V. 

* This work is supported by a grant from the Ministry of Science and Technology 
(grant no. 2001CCA03000), National Natural Science Fund (grant no. 60273045) 
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The minimum flow monitoring set is a flow monitoring set which contains 
minimum number of nodes of the graph. When the nodes of the graph are of non- 
negative weights, we call it the problem of weighted minimum flow monitoring 
set of graph G. 

Constraint (2) shows that the flow monitoring set problem satisfies flow con- 
servation. So, we can demonstrate that the problem of finding the minimum flow 
monitoring set from an underlying network gives rise to the problem of finding 
the minimum Weak Vertex Cover set [5] of a graph. In general, Weak Vertex 
Cover is a Vertex Cover Problem[ll] enriched with a linear system of equations 
for edge variables representing additional ’’knowledge” that can be used to re- 
duce the size of cover set. Hence we replace solving the problem of finding the 
minimum flow monitoring set by solving the problem of finding the minimum 
weak vertex cover set for a given graph. 

Definition 2 (Weak Vertex Cover) Given an undirected graph G = (V,E), 
where V u G V, deg{v) > 2 holds, we say S C V is a Weak Vertex Cover 
Set of G, if and only if every edge in G can he marked by performing the follow- 
ing three steps: 

( 1 )Mark all edges that are incident on vertices in S. 

(2) Mark the edge if it is the only unmarked edge among all of the edges which 
are incident on the same vertex. 

(3) Repeat step (2) until no new edge can he marked. 

We call it the weighted weak vertex cover problem when each vertex v 
has weight Wv associated with it. Hence the weighted weak vertex cover problem 
is to identify the smallest weight collection of vertices, and the vertex collection 
is a weak vertex cover set. 

For solving the problem of finding the minimum weak vertex cover set, [3] 
proves that S' C V is a weak vertex cover set of undirected graph G = (V, E) 
iff G' = {V',E') is a forest, where V = V — S and E' = {(u,u) | {u,v) G 
E()u G V Ov G V}, then brings forward a greedy approximation algorithm 
which gives an approximation ratio 2(ln<i -|- 1), where d = max„gy{ degree (u)}. 
And [4] proves that the weak vertex cover problem is JVV — hard, then gives 
an approximation algorithm with approximation ratio Ind-I- 1. Algorithm in [4] 
based on some property of cycle space in graph theory[12], and some of those 
properties are still used in this paper. 

The paper is structured as follows. The problem is bringed forward, and 
some basic definition and notation used throughout the paper are given in the 
Introduction section. We show a well character for cycle-rank-proportional graph 
in the next section, which provides a foundation for our algorithm. And in section 
3, we give an approximation algorithm to solve the minimum weighted weak 
vertex cover problem with approximation ratio of 2 — where Jz(G) = \E\ — 
\V\ 1. Then we bring forward our further research in the last section. 
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2 Well Character for Cycle-Rank-Proportional Graph 

Definition 3 In a weighted graph G = (V,E), a weight function w is Cycle- 
Rank- Proportional if for some constant c > 0, we have w{u) = c ■ for 
every u € V, where Ru is the number of decreased cycle rank of cycle space after 
deleting vertex u. 

We call the dimension of cycle space of graph G = (V, E) the cyclomatic 
number[12] of G, namely, i^{G) = \E\ — \V\ +Pq, where p^ is the number of 
connected components in G. Hence, 

Ru = HG) - jy{G - {u}) 

= {\E\ - \V\+p^) - {{\E\ - d{u)) - (|y| - 1) 

= d{u) - ku 

where ku is the increased number of connected components plus 1 after delet- 
ing u. 

fact 1 In Cycle- Rank- Proportional Graph, w{u) = c-{d{u) — ku) for every u € V . 

Definition 4 A cycle C = {vi,...,Vk} is a nearly -simple- cycle, if it has at 
most one vertex Vi, (1 < i < k), which incidents with other vertices not in the 
cycle. 

In this section, the weight function in graph we discussed is Cycle-Rank- 
Proportional, even more, we can look it as w{u) = d{u) — ku- And the graph we 
discussed in this section is connected and without nearly-simple-cycle. 

Lemma 1 For an arbitrary Weak Vertex Cover Set S of a connected graph 
G={V,E), w{S) > i^{G). 

Proof: After deleting all the vertices in S and edges incident with them, no cycle 
remains in the graph. The rank of cycle space in graph G is \E\ — \V\ + 1, where 
t is the number of connected components in G, in a connected graph, t = 1. 
Hence, 

w{S) = w{v) > \E\ - |V| -b 1 = i/(G) 
ves 



Lemma 2 If S is a minimal Weak Vertex Cover Set in a connected graph G = 
(V,E), where G contains no nearly-simple-cycle, then w{S) < 2 • {v{G) — 1). 

Proof: After deleting each vertex u in S, it will produce ku {ku > 1) components. 

Since S' is a weak vertex cover set in graph G, after deleting S and edges 
incident with vertices in S, the remained graph is a forest, say F, and F = {Ti}. 
Let S{Ti) denote the number of edges who connect tree Ti and S, where R e F. 
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Because S' is a minimal weak vertex cover set, any vertex in S has at least two 
edges connect to some tree in F. We call the collection of such vertices St that 
each of them is belongs to S and has at least two edges incident with tree T. 

Let p be a potential function in V, s.t. p{u) = d{u) — 2 for every u G V. And 
T be an arbitrary tree in F, assume that T has t vertices, then in this subgraph, 

p{T) = {d{u) - 2) = ^ d{u) -2t = 6{T) + 2{t -l)-2t = 5{T) - 2 

uGT uGT 

In this section, we only discuss the cycle-rank-proportional graphs, hence, any 
vertex in graphs is contained in one cycle at least. Therefore, for an arbitrary 
tree T in F, 6{T) > 2. In case St yf <!>, if 5{T) = 2, a nearly-simple-cycle exist, 
we will analysis it in the following section. Since St < each vertex in 

St can receive an extra potential of It is at least 1 if 6{T) > 3. 

Since any vertex in S must in some St, and remember the fact that > 1, 

we can get: 

= ^(d(u) - fc„) < - 1) < 5] (d{u) - 2) = 2\E\ - 2|y| 

uGS uGS uGV 

As the cyclomatic number iy{G) = \E\ — \ V\ + 1, therefore: 

w{S) < 2 ■ (i/(G) - 1) 



From the above two lemma, we can easily draw our conclusion as the following 
one. 

Theorem 1 S is any minimal Weak Vertex Cover Set of Cycle- Rank- Propor- 
tional graph C without nearly-simple-cycle, w is its weight function and opt{C, w) 
is the minimum Weak Vertex Cover Set in graph C, then: 

w{S) < (2 - -^) • w{opt{C,w)) 

3 Approximation Algorithm 

for Weighted Weak Vertex Cover 

For AfT^-hard optimization problems [11], we often design a polynomial time ap- 
proximation algorithm [6, 7] to get the solution which is close to the optimum 
solution. We call an algorithm for a minimum optimization problem has an ap- 
proximation ratio of p{n) if, for any input of size n, the cost C of the solution 
produced by the algorithm is within a factor of p{n) of the cost C* of an optimum 
solution: max(^) < p(n)[8j. 

In the first part of this section. We will first introduce the Local-Ratio The- 
orem[10], which was pioneered by Bar Yehuda and Even, who used it to develop 
an approximate graph vertex cover algorithm. Then we give our algorithm for 
weighted weak vertex cover problem by applying this theorem. 
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3.1 Introduction to Local-Ratio Theorem 

Theorem 2 [9] (Local-Ratio Scheme) If a feasible solution is r- approximation 
w.r.t a pair of weight functions W1 and W2 then it is also an r- approximation 
w.r.t Wl-hW2. 



We introduce the definition of decomposition before formalizing the above the- 
orem to the following one, hence we can apply it to our algorithm directly. 

Definition 5 For a weighted undirected graph G = (V, E), w is its weight func- 
tion, a set of weight functions {wi : i = 1, k} defined on V is a decomposi- 
tion of w if^iWi{u) < w{u) for each u GV. 



Theorem 3 [10] (Local Ratio Theorem) In a weighted graph G = (V,E), w is 
its weight function, let {wi, i = be a decomposition ofw, and S be any 

Weak Vertex Cover Set of G s.t. w{S) = '^iWi(S). Then, 



w{opt{G, w)) 



< max{ 



Wj{S) 

Wi{opt{G,Wi)) 



} 



where opt{G, w) denotes the optimum Weak Vertex Cover Set of G in weight 
function w. 



Proof: Since Wi is a decomposition of w, for any set U CV, w{U) > Wi{U). 
Besides, G{wi) is a subgraph of G{w) for every i, and if S' is a Weak Vertex Cover 
Set of graph G, then its restriction to any graph G' , where G' C G, is still a Weak 
Vertex Cover Set of G' . Therefore, we have Wi{opt{G,w)) > Wi{opt{G , Wi)) for 
all i. It follows now that 



w{S) 



< 



w^{s) 



< 



Wi(s) 



w{opt{G,w)) Y.iW^{opt{G,w)) Y.i^'i.{opt{G,Wi)) 

Wj(S) 



< max{ 



i Wi{opt{G,Wi)) 



} 



3.2 Description and Analysis of Algorithm 

From Local-Ratio Theorem, we can decompose the weight function w oi G into 
cycle-rank-proportional weight functions. And since the weight of any minimal 
weak vertex cover set is within 2--^^ of the optimum in cycle-rank-proportional 
graph, therefore, an approximation algorithm for weight weak vertex cover prob- 
lem with approximation of 2 — can be designed. 

The following algorithm WVC contains two main loops. The first loop de- 
compose the weight function w into {w^}, where every are cycle-rank-propor- 
tional for graph G. The second loop delete the redundant vertices step by step, 
hence, the remained vertices in any stage are minimal weak vertex cover set. 
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1 . 

2 . 

3 . 

4 . 



5 . 

6 . 

7 . 

8 . 

9 . 

10 . 

11 . 

12 . 

13 . 

14 . 

15 . 

16 . 

17 . 

18 . 

19 . 

20 . 
21 . 



WVC{G,w) 

find the nnmber k of connected components of graph G 
v{G) = \E\-\V\+k- S = cj> 
j = 0; Wj = w, s = u{G) 
while V (wj) 7 ^ (j> and s > 0 

/ / V{wj) denotes the largest subset of V, 

/ / each vertex u in it satisfying Wj (u) > 0 
if a nearly-simple-cycle (vi, ...,Vk) contained in graph G{V{wj)) 
then 

select the vertex Vi whose weight is minimal in {ui, ...,Vk} 
if V e {vi,...,Vk}, 

' ' 0 otherwise. 

else 

IuGHK)} 

// d(v,G(V(wj))) denotes the degree of v in graph G(V(wj)) 
w^(v) — A ■ (d(v, G(V (wj))) — kv) it v G V (wj) 

Wj+l = Wj — 

Sj = V{wj) - HK+i) 

find the nnmber fcj+i of connected components of graph G(H(wj+i)) 
s = |S(wj+i)| - \V{wj+i)\ + kj+i 

j + + 

for(t = j — 1 \ t >0 -,t ) 

for every u € St do 

if V{wt) — {S — u) indnces no cycle in G{V{wt)) 
then remove u from S 



fact 2 w(u) > w^(u) for every u GV . 

fact 3 Output S of algorithm WVC satisfies: w{S) = w^{S) 



From the above two fact and Local-Ratio-Theorem, we can get our conclu- 
sion, formally in the following theorem: 



Theorem 4 The approximation of algorithm WVC is 2 — . 

Proof: Each subgraph G' according to weight function wfi which belongs to 
the decomposition of weight function w, must either a nearly-simple-cycle, or a 
cycle-rank-proportional graph. Analysis in section 2 shows that the total weight 
of minimal weak vertex cover set of a cycle-rank-proportional graph is within 
2 — of the optimum. In the nearly-simple-cycle, the vertex we chosen in the 
algorithm is of the same weight with the optimum of the weighted subgraph. 

Therefore, the total weight of the algorithm’s output is within 2 — of 
the optimum. ■ 
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Theorem 5 The running time of algorithm WV C is 

0(|y|-(|S| + |V|)-min{KG),|V|}) 

Proof: Line 1 can be finished in 0{\E\ + | V|) by BTS algorithm. Line 2 and line 
3 run in constant time. Line 5 takes 0(|V|) since it maybe check all the vertex. 
Since w is a vector of rank \V\, line 6 to line 8 takes 0(|P|) time. Compute ky 
takes 0{\E\ + |y|) by BES algorithm, hence, line 11 can be finished in 0(|C| • 
(|if I + |y D) times. And line 12 to line 16 takes 0(|C| + | A|) time. Notice that the 
while loop works at most min{j/(G), |y|} times. Hence, the total running time 
of the while loop is G(|C| • {\E\ + |V|) • min{j/(G), |C|}). 

Since j < min{j/(G), |C|}, and line 18 to line 21 takes 0{\V\), the total 
running time of the for loop is G(|C| • min{j/(G), |C|}). 

Therefore, the total time of algorithm WVC is 

0{\V\-{\E\ + \V\)-mmMG),\V\}) 



4 Further Research 

In the paper, we give an approximation algorithm for the minimum weighted 
weak vertex cover problem, whose approximation ratio is 2 — , improve the 

previous work. Can we improve this approximation ratio into a better one? Or 
can we prove the inapproximation of this problem? Our research in the future will 
focus on these questions. And further, since the constraint of flow conservation 
equation only holds approximately, our research will focus on estimating the 
influence of the approximation error on the flow of edges calculated by this 
method, and try to give an algorithm solving the problem with approximation 
error. 
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Abstract. A cut {A, B) (where B = V — A) in a graph G{V, E) is 
called internal, iff there exists a node x in A which is not adjacent to 
any node in B and there exists a node y G B such that it is not adjacent 
to any node in A. In this paper, we present a theorem regarding the 
arrangement of cliques in a chordal graph with respect to its internal 
cuts. Our main result is that given any internal cut {A, B) in a chordal 
graph G, there exists a clique with k(G) + 1 nodes (where k(G) is the 
vertex connectivity of G) such that it is (approximately) bisected by the 
cut (A, B). In fact we give a stronger result: For any internal cut {A, B) 
of a chordal graph, for each i, 0 < i < k(G) + 1, there exists a clique Ki 
such that \Ki\ = k(G) + 1, |A n Ki\ = i and \B n Ki\ = k(G) + 1 — i. 
An immediate corollary of the above result is that the number of edges in 
any internal cut (of a chordal graph) should be where k(G) = k. 

Prompted by this observation, we investigate the size of internal cuts in 
terms of the vertex connectivity of the chordal graphs. As a corollary, we 
show that in chordal graphs, if the edge connectivity is strictly less than 
the minimum degree, then the size of the mincut is at least ^ 

where k(G) denotes the vertex connectivity. In contrast, in a general 
graph the size of the mincut can be equal to k{G). This result is tight. 



1 Introduction 

Let C be a cycle in a graph G. A chord of C is an edge of G joining two 
vertices of C which are not consecutive. A graph G is called a chordal graph iff 
every cycle in G of length 4 or more has a chord. Chordal graphs arise in many 
applications (see [7,10,15]), which includes the study of evolutionary trees [1], 
facility location [5], scheduling problems [11], and the solution of sparse systems 
of linear equations [12, 13]. Chordal graphs constitute one of the most important 
subclasses of perfect graphs [7]. The various names given to chordal graphs, 
such as triangulations, rigid circuit graphs, perfect elimination graphs, monotone 
transitive graphs and so on illustrate the importance of these graphs. 
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Let G = (y, E) be a connected undirected graph. Throughout this paper we 
will use V for the set of vertices of G and E for the set of edges. \V\ will be 
denoted by n. N{v) will denote the set of neighbours of v, that is N{v) = {u G 
V : {u, v) G E}. For AQV,^e use N{A) to denote the set IJweA -^(^) ~ The 
subgraph of G induced by the nodes in A will be denoted by G(A). 

The vertex connectivity k{G) of an undirected graph is defined to be the 
minimum number of vertices whose removal results in a disconnected graph or a 
trivial graph (ie, single node). A subset S' C y is called a separator if G{V—S) has 
at least two components. Then, k{G) is the size of the minimum sized separator. 
Note that a complete graph has no separator and its (vertex) connectivity is by 
definition n — 1. 

A cut (A,V — A) where % ^ Ad V is defined as the set of edges with exactly 
one end point in A. That is, (A, V — A) = {{u,v) € E : u € A and v GV — A}. 
In this paper we always use {A, B) instead of (A, V — A) for notational ease, i.e., 
B always means V — A. 

Definition 1. A node x G A is called a hidden node with respect to B (or with 
respect to the cut (A,B)), iff x ^ N{B)- 

Definition 2. A cut (A, B) is called an internal cut ofG iff there exists a hidden 
node x G A and a hidden node y G B. Equivalently, a cut {A, B) is internal iff 
N{A) yf B and N{B) yf A, ie, N{A) is a proper subset of B and N{B) is a 
proper subset of A. 

Note that if a; is a hidden node in A, then {x} (J A^(a:) C A. In fact, the only 
condition for (A, B) to be an internal cut is that there should exist a node x G A 
and y G B such that N{x) C A and N{y) C B. Thus if we can find two non 
adjacent nodes x and y, such that N{x)f]N{y) = 0, then there exist internal 
cuts in the graph: we just have to include {x} (J N{x) in A and {y} \JN{y) in B 
and divide the remaining nodes arbitrarily. The reader can easily verify that this 
can be done in U U ways. Thus in general, the number of 

internal cuts in a graph can be exponential. 

Our results: We present a structural theorem about internal cuts in chordal 
graphs. Let (A, B) be an internal cut in a chordal graph G with connectivity 
k{G) = k. Let z be a number such that 0 < z < A: + 1. Then we prove that there 
exists a clique Ki in G, such that \Ki\ = fc + 1, |ATifjA| = i and \Kiff\B\ = 
fc + 1 — z. (Theorem 3). 

An interesting special case of the above result is when z = . Then, 

the theorem states that, irrespective of which internal cut (A, B) we consider, 
it is always possible to find a clique of size k + 1, such that it is bisected by 
(A,B). We came up with this problem while trying to prove a theorem about 
the treewidth of a certain class of graphs which happened to have reasonably 
high connectivity. Note that for any graph G, treewidth(G) + 1 is the same as 
the minimum possible clique number (i.e., the number of nodes in the maximum 
clique) over all possible chordal graphs G' such that G' is a super graph of G. In 
order to get the desired result, the approach we followed required that we should 
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be able to find some cut {A, B) in the chordal completion of G, such that |A| 
and \B\ are more or less of the same size. Moreover, we wanted to ensure that 
no “big cliques” in the graph is bisected by the cut, or at least that given any 
big clique, the majority of the nodes in this clique belonged to one of the sides. 
But irrespective of however we increased the assumptions on the structure of the 
graph classes we were studying, we were unable to find nice chordal completions 
with such nice cuts. Working with small example graphs, we realised that when 
we try to “adjust” a cut (by moving certain nodes from AtoB and so on) such 
that a certain clique is not bisected, invariably some other clique gets bisected. 
This led us to exploring the arrangement of cliques with respect to cuts in chordal 
graph. 

As of now, we are unable to present any specific applications for our theorem. 
But we would like to point out that the chordal completions of graphs are of 
great interest from the point of view of two important optimisation problems: 
the treewidth and the fill-in problem. As mentioned in the previous paragraph, 
given a graph G, the treewidth problem is to find a chordal completion of G, 
so as to minimise the clique number. The fill-in problem seeks to find a chordal 
completion of G, which minimises the number of edges added. Also note that, 
properties such as connectivity are monotonic: i.e., as new edges are added, the 
property never decreases. Thus, the value of the property in the chordal comple- 
tion will be at least as much as in the original graph. Structural understanding 
of chordal graphs, with respect to such properties turn out to be useful, in mak- 
ing apriori inferences about the optimal chordal completions. (For example, see 
[4], where lower bounds for treewidth and pathwidth are derived in terms of a 
generalised connectivity property.) 

Moreover, chordal graphs are well-studied from various perspectives. For 
example, the vertex separators in chordal graphs are well understood. But the 
corresponding concept of edge separator, namely the cuts (due to some reason) 
are very little studied. In this paper, we attempt to understand chordal graphs, 
from the perspective of edge cuts. 

As mentioned before, the number of internal cuts can be exponential, in 
general. Then does the fact that every internal cut “cuts through” many cliques 
of size fc -|- 1 mean that always there exist a large number of cliques in a chordal 
graph? Not necessarily. It is well known that the number of maximal cliques 
(which can sometimes contain as small as A: -1-1 nodes) in a chordal graph is upper 
bounded by n. (See [7].) It is just that these few cliques, arrange themselves in 
such a way, that it is possible for every internal cut to “cut through” a few of 
them. Thus the result indeed throws some light on the arrangement of cliques 
with respect to cuts in a chordal graph. 

Another result inspired by the above theorem: An immediate consequence 
of the result (Theorem 3) is that the size of every internal cut has to be I?(k(G)^). 
This is because for i = internal cut would be approximately “bi- 

secting” a clique Ki with (k(G) -I- 1) nodes. After observing this, we investigated 
whether a better result is possible. In Theorem 4 we prove a lower bound for the 
size of internal cuts in chordal graphs, namely AG){AG)+i) ^ jg 130 ^^ 0 ]- than 
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what can be immediately infered from Theorem 3. We also show that this result 
is strict in the sense that there exist chordal graphs with vertex connectivity 
k{G) and having internal cuts with exactly 

The edge connectivity A(G) of a graph G is defined to be the minimum 
number of edges whose removal results in a disconnected graph or a trivial 
graph. A minimum cut is a cut consisting of A(G) edges. 

It is well known that the following inequality (due to Whitney [14] ) holds in 
a general undirected graph: 

k(G) < A(G) < ,5(G) 

where 6{G) is the minimum degree of the graph. For a discussion of this inequality 
and for related work see the chapter on connectivity in [8] . In [6] , Chartrand and 
Harary show that for all integers a, b, c, such that 0 < a < b < c, there exists a 
graph G with k{G) = a, A(G) = b and S{G) = c. 

We study this inequality when restricted to the class of chordal graphs. We 
observe that when A(G) < S{G), every mincut in G is also an internal cut. Thus, 
as a corollary of our result on internal cuts in chordal graphs, we show that there 
is a “gap” between A(G) and k(G) in chordal graphs provided A(G) yf 5{G). More 
specifically, if A(G) yf S{G), then A(G) > ''^iG){K(G)+i) ^ improves the result 
in [2] which shows that A(G) > 2n{G) — 1, when A(G) < 5{G). And the lower 
bound we obtain here is tight. 

1.1 Preliminaries 

The following result called Menger’s theorem is well known among graph theo- 
rists. An elegant proof due to Dirac is given in [8]. 

Theorem 1. /S/Menger’s theorem: The minimum number of vertices sepa- 
rating two nonadjacent nodes s and t is equal to the maximum number of vertex 
disjoint s — t paths. 

Thus, there are at least k{G) vertex disjoint paths between every pair of nonad- 
jacent nodes of G. 

A bijection / : P ^ {1, 2, • • • , n} is called an ordering of the vertices of G. 
Then f(v) is referred to as the number associated with the vertex v, or simply 
the number of v with respect to the ordering /. Given an ordering / of a graph 
G, we define the following terms. 

Definition 3. Let Af-V. The highest(A) is defined to be the vertex with the 
highest number in A. 

Definition 4. A path P = (wi,W 2 , • • • , Wk) in G is called an increasing path, iff 
f{wi) < f{w 2 ) < ■ ■ • < f{wk). It is called a decreasing path iff f{w\) > f{w 2 ) > 

• • • > f{wk). A single node can be considered as either increasing or decreasing. 

Definition 5. A vertex u G N{v) is called a higher neighbour of v iff f{u) > 
f{v). The set of higher neighbours of v will be denoted by Nh{v), ie 

Nh{v) = {ue N{v) : f{u) > f{v)} 
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Definition 6. An ordering f ofG is called a perfect elimination ordering (PEO) 
iff for each v GV, U ® complete subgraph (clique) of G. Then 

f{v) will be called the PEO number of v. 

Note that every graph may not have a PEO. The following theorem is well 
known (see [7]). 

Theorem 2. [7] An undirected graph G is chordal if and only if there exists a 
PEO for G. 

A node v is called simplicial iff N{v) induces a clique. The following obser- 
vation is due to Dirac: 

Lemma 1. [7] In every chordal graph, which is not a complete graph, there 
exists at least two non adjacent simplicial nodes. (In a complete graph, every 
node is simplicial). 

From Lemma 1, it is easy to make the following useful observation. (See for 
example, [7], page 84). 

Lemma 2. Suppose v is any node in a chordal graph G. Then there exists a 
PEO f for G such that f{v) = n. 

In fact more can be inferred. (See also, [9], page 8). Let K he a, clique in a 
chordal graph G. Then from Lemma l,it can be inferred that if V{G) — Lf yf 0, 
there should be a simplicial vertex xi G V{G) — K. Let f{xx) = 1. Now remove 
x\ from G. Clearly, the induced subgraph on the remaining nodes (say G') is 
also chordal and thus Lemma 1 is applicable to G' also. Then we can find a 
simplicial vertex X 2 GV (G") — K, which is simplicial (as long as V (G') — K is 
nonempty). Let f{x 2 ) = 2. By repeating this procedure, one can get a PEO / 
for G such that the nodes of K gets the highest numbers, namely the numbers 
from n — |AT| -I- 1 to n. We state this useful observation as a lemma. 

Lemma 3. Let K be any clique in a chordal graph G. Then there exists a PEO 
f of G such that the highest \K\ numbers are assigned to the nodes of K. In 
other words, n — \K\ + \ < f {x) < n, for each node x G K . 

Another well known characterisation of chordal graphs is in terms of minimal 
separators. (A separator S of two nodes x and y is said to be a minimal separator 
if no proper subset of S can separate x from y. Note that every separator of x 
and y should contain a minimal separator of x and y) . 

Lemma 4. A graph G is a chordal graph if and only if every minimal separator 
of G induces a complete subgraph. 

A chordless path from u to n is defined to be a path from u to w in G such 
that no two non consecutive nodes of the path are adjacent. The reader can 
easily verify that if there is a path between u and v then there is a chordless 
path also. For example, the shortest path between u and v has to be a chordless 
path. The following observations from [3] are useful tools. 
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Lemma 5. [3] Let P = (wi,W 2 , ■ • • ,Wk) be a chordless path in a chordal graph 
G and let Wi = highest(P) , with respect to a PEO f. Then (wi,W 2 , • • • , Wi) is an 
increasing path while {wi, Wi+i, ■ ■ ■ , Wk) is a decreasing path, with respect to f . 

Corollary 1. Let P = {w\,W 2 , • • • , Wk) be a chordless path and Wk=highest{P) 
with respect to a PEO f. Then P is an increasing path with respect to f. 

2 Internal Cnts in Chordal Graphs 

Theorem 3. Let {A, B) be an internal cut of a chordal graph G with k{G) = k. 
Then for each i, 0 < i < (k+1), there exists a clique Ki such that \Ki\ = k+1, 
Pi = i and \B f] Ki\ = k + 1 — i. 

Proof: Let x G A and y G B he hidden nodes with respect to the cut (A,B). 
(Since {A, B) is an internal cut, such nodes exists in A and B). Clearly x and y 
are nonadjacent. Then by Menger’s Theorem (Theorem 1), there exist n{G) = k 
vertex disjoint paths from x to y. If there are k vertex disjoint paths, then there 
are k vertex disjoint chordless paths also. Let these vertex disjoint chordless 
paths be P{, ; Pk- 

Clearly, N{A) separates x from y. Thus there exists a minimal separator 
M C N{A) C B which separates x from y. By Lemma 4, M is a clique. Then by 
Lemma 3, there exists a PEO such that the highest numbers are assigned to the 
nodes of M. Let / be such a PEO. Note that each path P/, \ < i < k, should 
pass through M, since M is a separator of x and y. Let Zi be the first node 
at which P/ meets M. Let Pi be the partial path from x to Zi of P/. Clearly, 
Zi = highest(Pi) (with respect to the PEO /), since Zi is the only node in Pi, 
that belongs to M, and the PEO / was selected such that the nodes in M got 
the highest numbers. Thus by Corollary 1, Pi is an increasing path. 

Now we define a series of k+1 element subsets of V (G), namely Pq, Pi, • • • , P^ 
as follows. Let Pq = {x,yi,y 2 , - ■ ■ ,yk} where yi is the first node in Pi after x. 
Note that each yi G A, since a: is a hidden node in A. Clearly each yi has 
a higher PEO number than that of x, since Pi is an increasing path. Thus 
{yi, 2 / 2 , • • • , 2/fc} C Nh{x). Thus Po = {a;, 2/i, 2 / 2 , • • • , 2/fc} induces a clique of size 
fc + 1, which is completely in A. 

Now we will describe how to define P^+i from P^, for 0 < i < r. Let Vi = 
lowest{Fi). Define El = Fi — {ui}. Suppose that the subset Fi satisfies the 
following properties. 

1. \F,\ = k + l. 

2. The nodes of Fi induces a clique. 

3. Fi C Ui=i Moreover, \F'(^Pj \ = 1 for each path Pj, 1 < j < k. (That 
is, in Pj', there is exactly one “representative” node from each path Pj). 

Now let Ui = lowest(F') = second-lowest (Pi). Suppose that Ui is on path Pj. 
Now we define Pi+i from Pi as follows. 

If Ui = Zj, the last vertex on path Pj, then let Fi = F^. That is. Pi will be the 
last subset in the series, in that case. Otherwise, consider the node Wi, which is 
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placed just after m on the path Pj. Since Pj is an increasing path (with respect 
to /)? /(tt'i) > f{ui). Also note that Wi ^ Fi. To see this, note that Wi ^ F/, since 
\Fl f]Pj \ = l,and Ui is already there in {F[ f]Pj). Also, Wi ^ Vi = lowest{Fi), 
since f{wi) > f{ui) > f{vi). Thus Wi is a node on Pj which is not already in Fi. 
We define Fj+i as follows. 



Fi+i = Fi- {uj} |J{wJ 

Note that Fi+i is obtained by replacing the node Vi, the lowest numbered 
node in Fi with a new node Wi. Thus |Fi+i| = k + \. Clearly lowest{Fi+i) = 
Ui- Also it is clear that since Fi was assumed to induce a clique and Wi is a 
(higher) neighbour of Ui, (Fi+i — {wi}) C Nh{ui), with respect to /. Thus 
induces a clique also. Clearly Fi+i C Uj=i Fj- Note that F[_^^ = Fi+i — {m} = 
F' — other words, is obtained by replacing Ui in F/ by 

Wj. Remember that |F/fjFj| = 1, for each j. Since we are just replacing the 
“representative” node Ui of Pj (in F/) by another node Wi (which is also on the 
same path Pj), to obtain the new set F/^_;^, it follows that |F/^_^ C\Fj\ = 
each Pj, 1 < j < k. Thus the three conditions satisfied by Fj are satisfied by 
Fi+i also. 

From the inductive argument above, together with the fact that Fq satisfies 
all the 3 properties stated above, it follows that each subset Fi in the sequence 
defined above satisfies those 3 properties. 

Now we look at the last subset F^ in this sequence. Let Ur = lowest{F!^). 
By definition of the sequence, Ur = Zj, (ie, the last vertex of Pj), for some path 
Pj. We claim that F^. = {zi,Z 2 , - ■ ■ ,Zk}- Suppose not. Then, since F^ should 
contain exactly k nodes, and F^ C Uj=i Fj, there should be a node Xr G F^., 
which is on some path Pi , but not an end vertex of Pi , \e Xr ^ zi. Since we had 
selected z; to be the first node at which Pi meets M, xi ^ M. But the PEO / 
was selected such that the nodes of M got the highest numbers. It follows that 
f{xi) < f{zj), contradicting the assumption that Zj = lowest(F^) . Thus we infer 
that F^ = {zi,Z 2 ,---,Zk}. 

Clearly |FofjA| = k + I, and |Frp|F| > k, since F^ = {zi, Z 2 , ■ ■ ■ , Zk} Q 
M C N{A) C B. Noting that at each step, we were replacing one node in Fj 
with a new one to get Fj+i, it follows that for each value i, 1 < i < k + 1, there 
should be a Fj. such that |Fj. fj A| = i, and |Fj. fj F| = A: + 1 — z. Let Ki be the 
clique induced by Fj. . Clearly Ki has the desired property. Now, to see that there 
exists a clique Kq such that \Kq\ = k + 1, \Kq fj A| = 0 and \Kq fj F| = A: + 1, 
just interchange the role of x and y and observe that just as there is a clique 
induced by Fg, comprising of x and its (higher) neigbours {j/i, ?/ 2 , • • • , 2/fc}, which 
is completely in A, there exists a clique which is completely in B comprising of 
y and its higher neighbours. Hence the proof. ■ 

As our next result we show that the number of edges in an internal cut of a 
chordal graph is at least AG){k,(G)+i) ^ 

Theorem 4. Let {A, B) be an internal cut of a chordal graph G with k{G) = k. 
Then \{A,B)\ > 
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Proof: For k{G) = 1, the lemma is trivially true because the graph is connected 
and any cut will have at least one edge in it. So we consider the case when 
k{G) > 2. Let {A^ B) be an internal cut of G. By the definition of internal cuts, 
there exist hidden vertices x £ A and y G B. Since G is a fc-connected graph, by 
Menger’s theorem (Theorem 1) there are k vertex disjoint paths between x and 
y. Let P{, ■ ■ ■ ,P^ denote these paths. For each P[, 1 < i < k, there is a chordless 
path between x and y whose vertices are a subset of the vertices of P/. As x and 
y are hidden vertices, it follows that each chordless path has at least 4 vertices 
in it. Let these chordless paths be P\, . . . ,Pk- Clearly, these are vertex disjoint 
paths. 

Let (mi, m), . . . , {uk, Vk) be the edges from Pi,. .. ,Pk, respectively, such that 
for 1 < t < fc. Mi G A and Vi G B. An edge is said to cross a cut if its end 
points are not on the same side of the cut. The theorem is proved by showing 
that for every pair of paths {Pi, P^}, 1 < i yf j < k, there is an edge Cij crossing 
the cut (A,B), one of whose end points is on Pi — {x,y}, and the other on 
Pj — {x, y}. As Pi, . . . , Pfc are vertex disjoint paths between x and y, an edge 
cannot be associated with two distinct pairs of paths. This counts (2) edges 
crossing the cut, which when considered along with the edges {ui,Vi\,l < i < k, 
yields |(A,P)| > = «(g)(«4G)+i) ^ 

Let Pi,Pj be two distinct paths from the set |Pi,...,Pfc}. We claim that 
there is an edge Cij crossing the cut (A,B), one of whose end points is on 
Pi — {x, y}, and the other on Pj — {a;, y}. Let us assume the contrary and prove 
our claim by arriving at a contradiction. Let wi be the last vertex in A on Pi 
which has an edge to a vertex on Pj. (Note that wi yf x, since the node appearing 
just after x, on Pi, itself has a neighbour on Pj, namely x). By our assumption 
that no chord starting from a node on Pi and ending at a node on Pj crosses 
the cut, N{wi) n Pj C A. In Pi, on the path from wi to y, let W2 be the first 
vertex in B that has an edge to a vertex on Pj. Clearly, W2 ^ y and by our 
assumption, N{w2) H Pj C B. Let W4 and W3 be the closest vertices on Pj such 
that W4 G N{wi) n Pj and W3 G N{w2) H Pj. Clearly, the portion of the path Pj 
from W3 to W4 will not contain any other vertices of N(wi) or N(w2). Now, the 
path from wi to W2, the edge (102,103), the path from 103 to 104, and the edge 
(104, wi) form an induced cycle of length at least 4. (This easily follows from the 
construction and the assumption that Pi and Pj are chordless paths). This is 
clearly a contradiction since G is a chordal graph. Therefore, our assumption is 
wrong and there exists an edge e^- which crosses the cut (A,B), with one end 
point on Pi and the other end point on Pj. Finally, such an edge cannot have 
a; or y as one of its end points, since that would violate the assumption that Pi 
and Pj are chordless. Hence the theorem. ■ 

The above lower bound for the size of an internal cut of a chordal graph can 
be tight. Consider the following graph for instance. 



Example 1: Construct a graph G as follows. Let A be a clique of Ni nodes 
and B a clique of N2 nodes. Let u\, U2, ■ ■ ■ ,Uk he k nodes in A and vi,V2, • ■ ■ ,Vk 
be k nodes in B. Add an edge from (1 < t < A:), to each node in the set 
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{ui, Ui+i, ■ ■ ■ , Uk}- Thus, vi is connected to {tti, U 2 , ■ ■ ■ , Uk}, V 2 is connected to 
{u 2 ,M 3 , • • • ,Uk} etc. 

To verify that the above graph is a chordal graph, we describe a PEO / for 
this graph. Let the last N\ numbers (i.e., the numbers from A ^2 + l to A ^2 + .^i) be 
given to the nodes of A in some order. Now let fiyi) = N 2 — i+1, for 1 < i < k. 
Let the remaining nodes in B get the numbers from 1 to N 2 — k in any order. 
It is easy to verify that this ordering is a PEO. It follows that G is a chordal 
graph. (See Theorem 2). 

Suppose that Ni,N 2 > k. Then it is easy to verify that k{G) = k. {A,B) 
is an internal cut since A contains nodes which are not adjacent to any node 
in B and B contains nodes which are not adjacent to any node in A. Clearly 
\{A,B)\ = k + k-l + --- + l= m 

3 Edge Connectivity vs. Vertex Connectivity 
in Chordal Graphs 

In this section we show that if for a chordal graph G, A(G) < S{G), then A(G) > 
where k{G) = k. 

Lemma 6. Let (A, B) be a mineut of a eonnected undirected graph G. Then 
G{A) and G{B) are connected. 

Proof: Suppose that G{A) is not connected. Let G{Ai) be a connected com- 
ponent of G{A), where Ai c A. Let A 2 = A — Ai. Clearly {Ai,B IJ A 2 ) is a cut 
of G and \{Ai, B[_f A 2 )\ < \{A,B)\, since {Ai,B[^A 2 ) C \a,B). But this is a 
contradiction since (A,B) is assumed to be the mineut. ■ 

Lemma 7. Let (A,B) be a mineut of G, and let A(G) = \{A,B)\ < 6{G). Then 
there exists a node x & A, such that x ^ N{B). Similarly 3y G B such that 
y ^ N{A). That is {A,B) is an internal cut. 

Proof: Suppose that every node in A is adjacent to some node in B. Let u be 
a node in A. Let F = (A, B), be the mininum cut. We have 

d{u) = |N(m) f| + \N{u) f]A\< |N(m) Pi S| + |A| - 1 

But 

\F\ = ^ |lV(a;) p s| > \N{u) p S| 4 - |A| - 1 

x&A 

since each term in the sum should be at least 1 by the assumption that every 
node in A is adjacent to at least one node in B. It follows that d{u) < |E|. 
But by assumption, d{u) > S{G) > 1^1. Thus we have a contradiction and we 
conclude that there is a node x G A such that x ^ N{B). By similar arguments, 
3y G B such that y ^ N{A). ■ 

Theorem 5. For a chordal graph G, if X{G) yf 5{G), then A(G) > 
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Proof: By Lemma 7, when A(G) < S(G), the mincut will be an internal cut. 
Then the result follows from Theorem 4 ■ 

The above theorem is strict. Consider the graph G in Example 1. If fVi, A ^2 > 
then it is clear that S(G) > = \[A,B)\ > A(G). In this case it is 

easy to verify that {A,B) is in fact the mincut. Thus A(G) = ^ 

this graph. 
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Abstract. We present a depth-first search algorithm for generating all 
maximal cliques of an undirected graph, in which pruning methods are 
employed as in Bron and Kerbosch’s algorithm. All maximal cliques gen- 
erated are output in a tree-like form. Then we prove that its worst-case 
time complexity is 0(3"^®) for an n-vertex graph. This is optimal as a 
function of n, since there exist up to 3"^® cliques in an n-vertex graph. 

1 Introduction 

In an undirected graph G, a clique is a complete subgraph of G in which any 
two vertices are adjacent. The set of vertices of a maximal clique of the com- 
plementary graph of G is a maximal independent set of G. Generating maximal 
cliques or maximal independent sets of a given graph is one of the fundamental 
problems in the theory of graphs and has many diverse applications, e.g., for 
clustering and for bioinformatics [10], [7]. A number of algorithms have been 
presented and evaluated experimentally or theoretically for this problem; see, 
e.g., [5], [14], [9], [6] and [4], where [4] is an elaborate survey paper on the 
maximum clique problem. Among them, Bron and Kerbosch [5] presented an 
algorithm for generating all maximal cliques of a graph together with some ex- 
perimental results. Tsukiyama et al. [14] devised an algorithm for generating all 
maximal independent sets in a graph G in 0(nm/r)-time, where n, to, and p are 
the numbers of vertices, edges, and maximal independent sets of G, respectively. 
Lawler et al. [9] generalized this result further. Chiba and Nishizeki [6] improved 
Tsukiyama et al.’s algorithm and gave a more efficient algorithm for listing all 
maximal cliques of G in 0{a{G)m p)-time, where a(G) is the arboricity of G 
with a(G) < for a connected graph G, and p is the number of maximal 

cliques in G. 
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We present here a depth-first search algorithm for generating all maximal 
cliques of an undirected graph, in which pruning methods are employed as in 
Bron and Kerbosch’s algorithm [5]. All the cliques generated are output in a 
tree-like form. Then we prove that its worst-case running time complexity is 
0(3"/^) for a graph with n vertices. This is the best one could hope for as a 
function of n, since there exist up to 3”/^ maximal cliques in a graph with n 
vertices as shown by Moon and Moser [11]. An earlier version of this paper 
appeared in [13]. 

2 Preliminaries 

[1] Throughout this paper, we are concerned with a simple undirected graph 
G = (V,E) with a finite set V of vertices and a finite set E of unordered pairs 
(u, w) of distinct vertices, called edges. A pair of vertices v and w are said to be 
adjacent if (v,w) G E. We call G = (V,E) with E = {(u,w) GVy.V\v^w, 
and (v,w) ^ E} the complementary graph of G. 

[2] For a vertex v G V, let E{v) be the set of all vertices which are adjacent to 
in G = (V, E), i.e., E{v) = {w G V \ (v,w) G E} { ^ v ). 

[3] For a subset W C V oi vertices, G{W) = {W,E{W)) with E{W) = {(f, w) G 
W xW \ {v,w) G E} is called a subgraph of G = (M, E) induced by W. For a set 
W of vertices, j W j denotes the number of elements in W. 

[4] Given a subset Q C V of vertices, the induced subgraph G(Q) is said to 
be complete if (v,w) G E for all v,w G Q with v ^ w. If this is the case, we 
may simply say that Q is a complete subgraph. A complete subgraph is also 
called a clique. If a clique is not a proper subgraph of another clique then it is 
called a maximal clique. A subset W CV of vertices is said to be independent if 
{v, w) ^ E for all v,w G W. Here, Q CV is a maximal clique of G if and only if 
Q is a maximal independent set of the complementary graph G. 

3 The Algorithm 

We consider a depth-first search algorithm for generating all the cliques of a 
given graph G = (V,E) {V ^ (j). 

Here we introduce a global variable Q of a set of vertices which constitute a 
complete subgraph being found up to this time. Then we begin the algorithm 
by letting Q := (j), and expand it step by step by applying a recursive procedure 
EXPAND to V and its succeeding induced subgraphs searching for larger and 
larger complete subgraphs until they reach maximal ones. 

Let Q = {pi,P 2 , ■ ■ ■ ,Pd} be found to be a complete subgraph at some stage, 
and consider a subgraph G{SUBG) which is induced by a set of vertices 

SUBG = vc F(pi) n r{p 2 ) n • • • n r{pd), 

where SUBG = V and Q = (j) at the initial stage. Then apply the procedure 
EXPAND to SUBG searching for larger complete subgraphs. If SUBG = (j), 
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then Q is clearly a maximal complete subgraph, or, a maximal clique. Otherwise, 
QU{q} is a larger complete subgraph for every q G SUBG. Then consider smaller 
subgraphs G{SUBGq) which are induced by new sets of vertices 

SUBGq = SUBG n r{q) 

for all q G SUBG, and apply recursively the same procedure EXPAND to SUBGq 
to find larger complete subgraphs containing QU {g}. 

Thus far we have shown only the basic framework of the algorithm for gen- 
erating all maximal cliques (with possible duplications). This process can be 
represented by the following search forest, or the collection of search trees: The 
set of roots of the search forest is exactly the same as V of graph G = {V,E). 
For each q G SUBG, all vertices in SUBGq = SUBG n T(g) are sons of q. Thus, 
a set of vertices along a path from the root to any vertex of the search forest 
constitutes a complete subgraph, or a clique. We shall show an example of a 
search forest (with unnecessary subtrees deleted) later in Fig. 2 (b). 

Now we proceed to describe two methods to prune unnecessary parts of 
the search forest. We regard the previously described set SUBG (yf (j)) to be 
an ordered set of vertices, and we continue to generate maximal cliques from 
vertices in SUBG step by step in this order . 

First, let FINI be the first half of SUBG and suppose that we have already 
finished expanding search subtrees from every vertex q' G FINI C SUBG to 
generate all maximal cliques containig Q U {g'} , and that only the remaining 
vertex g G CAND = SUBG — FINI is a candidate for further expansion of the 
present complete subgraph Q to generate new larger cliques. That is, let 

SUBG = FINI U GAND {FINI n CAND = fi) , 



with 



FINI = {q' G SUBG \ The procedure has finished generating 

all maximal cliques containing Q U {g^}}. 

Consider the subgraph G{SUBGq) with SUBGq = SUBG n F{q), and let 

SUBGq = FINIq U GANDq {FINIq n CANDq = fi), 

where 

FINIq = FINI n F{q), and GANDq = CAND n F{q). 

Then only the vertices in the subgraph G{CANDq) can be candidates for ex- 
panding the complete subgraph Q U {g} to find new larger cliques, since all 
the cliques containing {Q U {g}) U {r} with r G FINIq C FINI have already 
been generated for any r by application of the procedure EXPAND to FINI 
as stated above. Thus, further expansion is to be considered only for vertices in 
G{CANDq) excluding ones in FINIq = SUBGq — GANDq. 

Secondly, given a certain vertex u G SUBG, suppose that all the maximal 
cliques containing QU {u} have been generated. Then every new maximal clique 
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containing Q, but not Q U {m}, should contain at least one vertex v G SUBG — 
r{u). This is because if Q is expanded to a complete subgraph R C (Q U S') n 
{SUBG — {m}) with S C r{u) , then R U {m} is a larger complete subgraph, 
and hence R is not maximal. Thus, any new maximal clique can be found by 
expanding Q to Q U {g} such that q G SUBG — U{u), and by generating all 
the cliques containing Q U {g}. Therefore, if we have expanded a search subtree 
from u G SUBG, then we should not expand any search subtree from w G 
SUBGnr{u). 

Taking the previous pruning method also into consideration, the only search 
subtrees to be expanded are from vertices in {SUBG — SUBG H r{u)) — FINI = 
GAND — r{u) (9 u). Here, in order to minimize | CAND — U{u) |, we choose 
such vertex u G SUBG to be the one which maximizes | GAND n B{u) |. In 
this way, the problem of generating all maximal cliques of G{GAND) can be 
decomposed into k =\ CAND — D{u) \ such subproblems; see Lemma 1 (i) in 
Section 4. 

Now we present an algorithm CLIQUES for generating all maximal cliques 
without duplications in Fig. 1. 

If and only if Q is found to be a maximal clique at statement 2, we only print 
out a string of characters “clique,” instead of Q itself at statement 3. Otherwise, 
it is impossible to achieve the worst-case running time of 0(3”/^) for an n- 
vertex graph, since printing out Q itself requires time proportional to the size of 
Q. Instead, in addition to statement 3, not only do we print out q followed by 
a comma at statement 7 every time q is picked out as a new element of a larger 
clique, but we also print out a string of characters “hack,” at statement 12 after 
q is moved from CAND to FINI at statement 11. We can easily obtain a tree 
representation of all the maximal cliques from the resulting sequence printed by 
statements 3, 7, and 12. Here, primed statements O', 7', and 12' are only for the 
sake of explanation. 

Example. Let us apply the above algorithm CLIQUES to a graph in Fig. 2 (a). 
Then the whole process is represented by a search forest in Fig. 2 (b), and we 
show the resulting printed sequence in Fig. 2 (c) with appropriate indentations. 
In Fig. 2 (b), each set of vertices surrounded by a flat circle represents SUBG at 
that stage, in which vertex with A mark is in FINI C SUBG at the beginning. 
Vertex u chosen at statement 4 is marked by El or Q depending on whether it is 
in CAND or FINI, respectively. Other vertices in CAND — F{u) are marked 
by o, while vertices in CAND n F{u) are marked by •. In conclusion, all the 
maximal cliques of G are {4, 6, 7, 8}, {4, 6, 5}, {4, 3, 8}, {1, 2, 9}, and {2, 3, 9}. □ 

Now given only the resulting printed sequence in Fig. 2 (c) without indenta- 
tions, we can easily obtain essentially the same result as above by reconstructing 
from it a tree which represents a principal part of the previous search forest in 
Fig. 2 (b). Here, a dot V) is introduced as a virtual root of the tree at the 

beginning. Then every time “q,” is encountered in the sequence, we expand a 
downward edge whose end point is labeled by q. If and only if “q,” is followed by 
“clique,”, the set of all the vertices along the path from the root to the vertex 
q excluding the root ( • ) represents a maximal clique. Every time “back,” is 
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procedure CLIQUES (G) 

/* Graph G = (U, E) */ 

begin 

0' :/* Q := <j> */ 

/* global variable Q is to constitute a clique */ 

1 : EXPAND(U,U) 

procedure EXPAND (S'UBG, CAND) 

begin 

2 : if S'f/BG = (j> 

3 : then print {“clique,”) 

/* to represent that Q is a maximal clique */ 

4 : else u := a vertex u in SUBG which maximizes | CAND n E{u) \ 

/* let EXT^, = CAND - E{u) */ 

5 : while CAND - E{u) 7 ^ (p 

6 : do q := a vertex in {CAND — E{u)) 

7 : print {q, 

/* to represent the next statement */ 

7'-/* Q:=QU{q}-, */ 

8 : SUBGg ■- SUBG n E{q)- 

9 : CANDq := CAND n E{q)- 

10 : EXPAND(S'UBG 5 ,GAArD 5 ); 

11: CAND-- CAND -{q} /* FINI ■- EINI U {q} */ 

12 : print {“back,”) 

/* to represent the next statement */ 

12 :/* Q := Q — {g} */ 

od 

fi 

end of EXPAND 
end of CLIQUES 

Fig. 1 . Algorithm CLIQUES 



encountered in the sequence, we go up the tree backward by one edge to find 
other maximal cliques. It is clear that this transformation can be done in time 
proportional to the length of the resulting sequence. 

4 The Worst-Case Time Complexity 

Given G = {V, E) with V (f>, we evaluate the worst-case running time of the 
previous algorithm CLIQUES(G) with the primed statements O', 7', and 12' 
having been deleted. So, this is equivalent to evaluating that of EXPAND(U, V). 
Now we begin by giving a few definitions. 

[1] Let T{n, m) be an upper bound on the worst-case running time of EXPAND 
{SUBG, CAND) when | SUBG \= n and | CAND \= m {n > m > 1). 

[2] Let Tfc (n, m) be an upper bound on the worst-case running time of EXPAND 
{SUBG, CAND) when | SUBG \= n, \ CAND |= to, and | EXTu \ = 

I CAND — r{u) 1= k at the first entrance to statement 5. 
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(a) An input graph G 



4, 6, 7, 8, clique, back, back, 
5, clique, back, back, 

3, 8, clique, back, back, back, 

1. 2. 9, clique, back, back, back, 

2. 3. 9, clique, back, back, back, 
9, back, 

(c) A resulting printed sequence 




[3] Let us consider a nonrecursive procedure EXPANDq (S'L'BG, CAND) which 
is obtained from EXPAND (S'LBG, CAND) by replacing a recursive call 10: 
EXPAND {SUBGq, CANDq) by 10':EXPAND(i?;., 0). The running time of 
EXPANDg (S'tZBG, CAND) when | SUBC \= n and | CAND |= m can be 
made to be 0{v?) (in which, selection of u at statement 4 can be done in 
0(n^)), and so we assume that it is bounded above by the following quadratic 
formula 



P{n) = Pin^, where pi > 0 . 



□ 
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From the above definitions, we have that 

T(n,m) = Max {Tk{n,m)} . (1) 

l<fc<m 

The following Lemma is a key for evaluating T{n, m) . 

Lemma 1. Consider F,XPAND{SUBG,CAND) when \ SUBG |= n, 

I GAND 1= m, I EXTu |=| CAND-r{u) |= fc yf 0, and \ GANDnr{u) |= m- 
k at the first entrance to statement 5. In what follows, GAND stands exclusively 
for this initial value, though it is decreased one by one at statement 11 in the 
while loop. Let GAND — B{u) = {vi,V2, • ■ • , Vk} and the vertex at statement 6 
be chosen in this order. Let 



SUBGy, = SUBG n r{vi), 

CANDy^ = GAND n rfuf), and 
GANDi = {GAND - {m, U2, . . . , n,_i}) n r{v,). 

Then the following hold. 

(i) Tfed SUBG I, I GAND |) < ^(1 SUBGy, |, | GAND, |) + P{n). 

(ii) a) I GANDi |<| CANDy, \<m-k. 
b) I SUBGy, j<n — fc<n— 1. 

Proof, (i) It is obvious from procedure F,XPAND{SUBG,GAND) and the 
definition of P{n). 

(ii) a) I GANDi |<| CANDy, | = | CANDnP{vi) |<| GANDnr{u) \=m-k. 
b) I SUBGy, 1 = 1 SUBGnr{v,) I 

=1 FiNinr{v,) I -h I GANDnr{v,) |, 

with I FIN I n P{vi) |<| FIN I \= n — m and | GAND n P{vi) |< m—k. 
Here, (n — m) -I- (m — fc) = n — A: < n — 1, since fc > 1. Q.E.D. 

Theorem 1. For the upper bound T{n,m) of the worst-case running time of 
FXPNND{SUBG,GAND) with \ SUBG \= n and \ GAND |= m, the following 
holds for all n > m> 1 : 

T{n, to) < - Q{n) = R{n), (2) 



where 



with 



Q{n) = qin^ + q2n-Iqz, 



qi = pi/2 >0, 92 = 9pi/2 >0, 93 = 27pi/2 > 0, 



and 

C = Max{Ci,C2,C3}, 

with Gi = 392 / In 3 = 27pi/21n3, C 2 = 39pi/(2 • 3^/^), and C3 being the maxi- 
mum over n o/3(l — 2 • 3“^/^)“^ • Q{n — 3)/3"/^. (Note that Q(n — 3)/3”/^ is 
bounded above, since it approaches 0 as n tends to infinity. Hence, C 3 is well- 
defined.) 

Here, R{n) = (73"/^ — Q{n) is monotone increasing with R{n) > 0 for all 
integers n > 0 . 
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Proof. First, by differenciating a continuous function R(x) with x being a real 
number and C > C\, we can prove that R{n) is monotone increasing for all 
integers n > 0. Furthermore, i?(l) = — Q(l) > — 37pi/2 = pi, 

since C > C 2 = 39pi/(2 • 3^/^). So, i?(n) > Pi for all integers n > 1. 

Now we prove that Eq.(2) holds by induction on n. 

Basis. n(= m) = 1. We have T(l, 1) = P(l) = pi by the definition of P{n). 
So, Eq.(2) holds for n = m = 1, since R{1) > pi. 

Induction step. We assume that Eq.(2) holds for all integers n,m, 1 < m < 
n < N, and prove that it also holds for 1 < m < n = + 1. 

Consider EXPAND(S'L®G,CAfVi:)) when | SUBG \= n = fV+1, | CAND | = 
m (1 < m < n = IV + 1), I EXTu |=| CAND - r{u) |= fc yf 0 with CAND - 
r{u) = {vi,V 2 , ■ . ■ ,Vk} at the first entrance to statement 5. Then just as in 
Lemma 1 (i), we have 

Tfe(n,m) = Tfed SUBC |, | CAND |) 

k 

<Y,T{\ SUBC,, I, I CAND, I) + P(n), 

i=l 

where | SUBC,, \< n — 1 = N hy Lemma 1 (ii)-b). Then by the induction 
hypothesis we have 

k k 

Y,T{\ SUBC,, I, I CAND, |) < ^i?(| SUBC,, |). 

i=l 

Since R{n) is monotone increasing and | SUEGy. \< n — k, we have 

k 

R{\ SUBC,, I) < kR{n - k). 

Combining these inequalities followed by some calculations yields 

Tk{n, m) < G3"/3 - {3Q(n - 3) - P{n)} 

= G3”/^ — Q{n) ( from the definition of Q{n)). 

Substituting this inequality into Eq.(l), we have 

T(n,m) < G3"/3-Q(n). 

Thus, Eq.(2) also holds for n = N+ l>m> 1. Therefore, Eq.(2) has been 
induced to hold for all integers n > m > 1. Hence the result. Q.E.D. 

In particular, 

T(n,n) < G3"/3 -g(n). 

Therefore, we conclude that the worst-case running time of the algorithm 
CLIQUES(G) is 0(3"/^) for an n-vertex graph G = (E, E). 

Note here that if we output a list of all the individual maximal cliques, it 
takes G(n3”/^) time in the worst case. 
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5 Concluding Remarks 

We have given an algorithm for generating all maximal cliques in a graph of n- 
vertices and have proved that its worst-case time complexity is 0(3”/^) that is 
optimal as a function of n. It is regarded to be very hard to analyze theoretically 
the time complexity of our algorithm as a function of the number of maximal 
cliques in the graph. 

The algorithm CLIQUES is very simple and is easy to be implemented. It 
is demonstrated by our computational experiments that CLIQUES runs much 
faster than the sophisticated algorithm CLIQUE [6] for several random graphs 
and Moon Moser graphs. In particular, CLIQUES is very much faster than 
CLIQUE for non-sparse graphs in which the number of maximal cliques is very 
large. In our experiments, for example, our algorithm CLIQUES runs more than 
100 times faster than CLIQUE for random graphs of 180 vertices with edge 
density = 0.5, and for a Moon Moser graph of 45 vertices. 

Our present depth-first search algorithm together with some heuristics yields 
an efficient algorithm for finding a maximum clique [12] that is experimentally 
demonstrated to outperform in general other existing such algorithms. It has 
been successfully applied to interesting problems in bioinformatics [l]-[3], the 
design of DNA and RNA sequences for bio-molecular computation [8] , and oth- 
ers. 

Note here that the algorithm in [12] for finding a maximum clique does not 
select a vertex with the maximum degree in each recursive call, which is in con- 
trast to statement 4 in Fig. 1 of the present algorithm CLIQUES for generating 
all maximal cliques. This is because the algorithm[12] with a simple bounding 
rule by heuristics for finding only one maximum clique in each recuresive call 
takes less time than algorithms which are based on such a strategy as above for 
CLIQUES. In our present problem, however, we can not use such a bounding rule 
for finding only one maximum clique. In addition, theoretical time-complexity 
of an algorithm with heuristics is very hard to analyze. This is the reason why 
we employ here the strategies as in Bron and Kerbosch’s algorithm. 
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Abstract. In this paper we introduce a notion of a regular expression 
over infinite alphabets and show that a language is dehnable by an infinite 
alphabet regular expression if and only if it is acceptable by finite-state 
unifieation based automaton - a model of computation that is tightly 
related to other models of automata over infinite alphabets. 



1 Introduction 

A new model of a finite-state automata dealing with infinite alphabets, called 
finite-state datalog automata (FSDA) was introduced in [7]. FSDA were intended 
for the abstract study of relational languages. Since the character of relational 
languages requires the use of infinite alphabets of names of variables, in addition 
to a finite set of states, FSDA are equipped with a finite set of “registers” capable 
of retaining a variable name (out of an infinite set of names). The equality test, 
which is performed in ordinary finite-state automata (FA) was replaced with 
unification, which is a crucial element of relational languages. 

Later, FSDA were extended in [3] to a more general model dealing with 
infinite alphabets, called finite-memory automata (FMA). FMA were designed 
to accept the counterpart of regular languages over infinite alphabets. Similarly 
to FSDA, FMA are equipped with a finite set of registers which are either empty 
or contain a symbol from the infinite alphabet, but contrary to FSDA, registers 
in FMA cannot contain values currently stored in other registers. By restricting 
the power of the automaton to copying a symbol to a register and comparing the 
content of a register with an input symbol only, without the ability to perform 
any functions, the automaton is only able to “remember” a finite set of input 
symbols. Thus, the languages accepted by FMA posses many of the properties 
of regular languages. 

Whereas decision of the emptiness and containment for FMA- (and, conse- 
quently, for FSDA-) languages is relatively simple, the problem of inclusion for 
FMA-languages is undecidable, see [6]. 

An extension of FSDA to a general infinite alphabet called finite-state unifica- 
tion based automata, (FSUBA) was proposed in [8]. These automata are similar 
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in many ways to FMA, but are a bit weaker, because a register of FSUBA may 
contain values currently stored in other registers. It was shown in [8] that FSDA 
can be simulated by FSUBA and that the problem of inclusion for FSUBA lan- 
guages is decidable. 

While the study of finite automata over infinite alphabets started as purely 
theoretical, since the appearance of [3] and [4] it seems to have turned to more 
practically oriented. The key idea for the applicability is finding practical inter- 
pretations to the infinite alphabet and to the languages over it. 

— In [6] , members of (the infinite) alphabet E are interpreted as records of com- 
munication actions, “send” and “receive” of messages during inter-process- 
communication, words in a language L over this alphabet are MSCs, message 
sequence charts, capturing behaviors of the communication network. 

— In [I], members of E are interpreted as URLs’ addresses of internet sites, a 
word in L is interpreted as a “navigation path” in the internet, the result of 
some finite sequence of clicks. 

— In [2] there is another internet-oriented interpretation of E, namely, XML 
mark-ups of pages in a site. 

In this paper we introduce a notion of a regular expression for languages over 
infinite alphabets and show that a language is definable by an infinite alphabet 
regular expression if and only if it is acceptable by an FSUBA. 

The paper is organized as follows. In the next section we recall the definition 
of FSUBA and in Section 3 we present the main result of our paper - unification 
based regular expressions for languages over infinite alphabets, whose equivalence 
to FSUBA is proven in the Section 4. 

2 Finite-State Unification Based Automata 

Till the end of this paper E is an infinite alphabet not containing ^ that is 
reserved to denote an empty register. For a word w = w\W 2 ■ ■ ■ Wr over A U {#}, 
we define the content of w, denoted [tu], by [tu] = {wi yf # : i = 1,2, ... ,r}. 
That is, [m] consists of all symbols of E which appear in w. 

Definition 1. A finite-state unification based automaton (over E) or, shortly, 
FSUBA, is a system A = {E, Q, qg, F, u, 0, pi), where 

~ Q; 9o G Q, F C Q are a finite set of states, the initial state, and the set 
of final states, respectively. 

u = u\U 2 • • - Ur G (A U {#})”, r > 1, is the initial assignment - register 
initialization: the symbol in the ith register is Ui. Recall that ff is reserved to 
denote an empty register. That is, if Ui = ff, then the ith register is empty. 

— 0 C [m] is the “read only” alphabet whose symbols cannot be copied into 
empty registers^ . One may think of 0 as a set of the language constants 
which cannot be unified, cf [7]. 

^ Of course, we could let 0 be any subset of A. However, since the elements of 0 
cannot be copied into empty registers, the automaton can make a move with the 
input from 0 only if the symbol already appears in one of the automaton registers, 
i.e., belongs to the initial assignment. 




Regular Expressions for Languages over Infinite Alphabets 173 



— fj, C Q X {1,2, ... ,r} X " X Q is the transition relation whose el- 

ements are called transitions. The intuitive meaning of fi is as follows. If 
the automaton is in state q reading the symbol a and there is a transition 
{q, m, S, q') G /x such that the mth register either contains a or is empty, 
then the automaton can enter state q' , write a in the mth register (if it is 
empty), and erase the content of the registers whose indices belong to S. The 
mth register will be referred to as the transition register. 

Like in the case of FSDA, an actual state of A is a state of Q together with the 
contents of all its registers. That is, A has infinitely many states which are pairs 
(q,w), where q G Q and m is a word of length r - the content of the registers of 
A. These pairs are called configurations of A. The set of all configurations of A is 
denoted The pair {qg, u), denoted q(,, is called the initial configuration^, and 
the configurations with the first component in F are called final configurations. 
The set of final configurations is denoted F'^. 

Transition relation p. induces the following relation on Q'^ x S x Q'^ . 

Let q,q' G Q, w = w\W 2 - • • Wr and w' = w'^w^ - • • w(. Then the triple 
{{q, w), a, {q' , w')) belongs to if and only if there is a transition {q, m, S, q') G p 
such that the following conditions are satisfied. 

— Either Wm = # (i-e., the transition register is empty in which case a is copied 
into it) and a ^ 0, or Wm = o' (i.e., the transition register contains a). 

— If m ^ S', then = a, i.e., if the transition register is not reset in the 
transition, its content is a. 

— For all j G S, w) = #. 

— For all j ^ S U {m}, w) = Wj. 

Let cr = (Ti (72 ••• cr„ be a word over S. A run of A on cr consists of a sequence 
of configurations co,ci,...,Cn such that cq is the initial configuration gg and 
{d-i, o„Ci) G p‘^,i= 1,2,..., n. 

We say that A accepts cr, if there exists a run Cq, ci, . . . , c„ of A on cr such 
that c„ G F^. The set of all words accepted by A is denoted by L{A) and is 
referred to as an FSUBA-language. 

Example 1. ([8]) Let A = (A, {q, s, /}, q, {/}, #, 0, p) be a one-register FSUBA, 
where p consists of the following two transitions: 



- (g, l,0,s) 

- (s, 1,0,/). 

Alternatively, p can be described by the following diagram. 




^ Recall that go and u denote the initial state and the initial assignment, respectively. 
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It can be easily seen that L{A) = {a\a2 & cri = (T2}: an accepting run 
of A on the word aa is (g, #), (s, a), (/, a). 

In contrast, the language L = : ai ^ 0-2} is not an FSUBA 

language^. To prove that, assume to the contrary that for some FSUBA A = 
{S,Q,qo,F,u, 0 ,iJ,), L = L{A). Since S is infinite and [u] is finite, con- 

tains two different symbols CTi and <72 ■ By the definition of L, it contains the 
word (Ji(T 2. Let {qo,u),{qi,Wi),{q2,W2), = Wi,iWi,2 • • • Wi.r, * = 1 , 2 , be 

an accepting run of A on aia2 and let m be the transition register between 
configurations (qi,wi) and (52, '1^2)- Since neither of a\ and (J2 belongs to u 
and (Ti yf (J2, wi,m = # and W2,m = <^2- Then, replacing W2,m with cti in 
{qo, m), (<7i, lUi), ((72, W2) we obtain an accepting run of A on cticti which contra- 
dicts L = L{AY. 

Example 2 . ([ 8 ]) Let A = (A, Q, qo, F, u, 0 , p) be an FSUBA such that # does 
not appear in [u] and for all {q,m, S,q') £ p, S = %. Then L{A) is a regular 
language over [i 6 ] . In general, since the restriction of a set of configurations to a 
finite alphabet is finite, the restrictions of FSUBA-languages to finite alphabets 
are regular, cf. [ 4 , Proposition 1 ]. 

We conclude this section with the observation that FSUBA languages are 
closed under union, intersection, concatenation, and the Kleene star®. The proof 
of the closure under union and intersection® is based on an alternative equivalent 
model of computation called FSUBA with multiple assignment that is similar to 
M-automata introduced in [ 4 ] , and for the proof of the closure under concatena- 
tion and Kleene star we show that FSUBA with e-transitions can be simulated 
by ordinary FSUBA. 



3 Regular Expressions for FSUBA Languages 

In this section we introduce an alternative description of FSUBA languages 
by the so called unification based expressions which are the infinite alphabet 
counterpart of the ordinary regular expressions. 

Definition 2 . Let X = {x \, . . . , Xr} be a set of variables such that A n A = 0 
and let 0 be a finite subset of A. Unification based regular expressions over 
(X, 0 ), or shortly UB-expressions, if{X, 0 ) is understood from the context, are 
defined as follows. 

® It can be readily seen that L is accepted by a finite-memory automaton introduced 
in [3]. 

^ The decision procedure for the inclusion of FSUBA-languages in [8] is based on a 
refined version of this argument. 

® [8] deals with the inclusion problem only and does not address the closure properties 
of FSUBA languages at all. 

® It follows from Example 1 that FSUBA languages are not closed under complemen- 
tation. 




Regular Expressions for Languages over Infinite Alphabets 175 



— 0,e and each element of X LI O are UB- expressions. 

— If ai and a 2 are UB- expressions, then so is {a\ + 02 ). 

— If X' C X and ai and a 2 are UB- expressions, then so is {a\ -x' 0 : 2 )- 

— If ai is a UB- expression, then so is (ai)*. 

Remark 1. A unification based regular expressions a over (X, 0) can be thought 
of as an ordinary regular expression over (finite) alphabet XL0L{-x’ ■ AT' C A}, 
that we shall also denote by a. That is, the ordinary regular expression a is ob- 
tained from the corresponding UB-expression by replacing its each subexpression 
of the form ai -x' 0:2 with a\ • -x' • ct 2 - We leave the formal definition to the 
reader. 

Next we define the language generated by a UB-expression. Let a be a UB- 
expression over {X, 0) and let w = W1W2 • ■ ■ Wn G Lxu 0 u{ x'-X'cx}{c() With 
the ith symbol Wi of w, i = 1, 2, . . . , n, we associate a word wi G EL {e} in the 
following manner. 

— If Wi G 0, then wi = Wi- 

— If Wi is of the form -x', where X' C X, then wi = e. 

— If Wi G X, then wi can be any element of E \ 0 such that the following 
condition is satisfied. 

Let Wi = x G X and let j be the minimal integer greater than i such that 
Wj = X. If there is no k, i < k < j, such that Wk is of the form -x' and 
x G X' ^ , then wi = wJ. 

The word w = W\W 2 • • • wH, where wi is as defined above, i = 1,2, ... ,n, is 
called an instance of w. We denote by L{a) the set of all instances of all elements 
of Lxu 0 u{ x'.x'cx}(o:)- 

Example 3. Consider a UB-expression a = x -0 {y -{j,} e)* -g x over {{x, y}, 0). It 
can be easily seen that L{a) consists of all words over E having the same first 
and last symbols. 

Similarly, for a UB-expression x -0 x over ({x}, 0), L{x -0 x) is the language 
from Example 1. 

Theorem 1. A language is defined by a UB-expression if and only if it is ac- 
cepted by an FSUBA. 

The proof of the “only if” part of the theorem is based on the closure prop- 
erties of FSUBA-languages and is quite standard. The proof of the “if” part of 
Theorem 1 is presented in the next section. 

We conclude this section with one more closure property of FSUBA-languages 
that is an immediate corollary to Theorem 1. 

Of course, e is redundant (but, still, very useful), because L(0*) = {e}. 

® Here and hereafter, Lxueu{.xr.x>cx}{o) denotes the language over X L 0L {-x' ■ 
X' C X} defined by a, see Remark 1. 

® That is, no subscript X' that appears between the ith and the jth positions of w 
contains x. 
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Corollary 1. FSUBA languages are closed under reversing^^ . 

Proof. It can be easily verified that, for a UB-expression a, (L{a))^ = L{a^), 
where is defined by the following induction. 

— If a G {0, e} U X U 6>, then is a. 

— (oi + is (of + off). 

— (ai -jf/ is (af -jf/ of). 

— ((«)*)« is 

4 Proof of the “if” Part of Theorem 1 

The proof of the “if part” of Theorem I is based on a tight relationship between 
FSUBA and ordinary FA. We shall use the following model of FA. 

Definition 3. ([5]) A (non- deterministic) finite state automaton over a finite 
alphabet S' is a system M = {Q,qQ,F, A), where 

— Q is a finite set of states. 

— qo € Q is the initial state. 

— F C Q is the set of final states. 

— A is a finite subset of Q x S'* x Q called the transition relation. 

A word w over alphabet S' is accepted by Af, if there is a partition w = 
Ui ■ ■ - Un of w, Ui G S'* , and a sequence of states Sq, Si, . . . , such that 

“ So = go, 

— Sn G F, and 

— and for each f = 0, 1, . . . , n — 1, (sj, Ui+i, s^+i) G A. 

Let A = {S,Q,qQ,F,u,0,p.) be an FSUBA with r registers. By [8, Claim 
2], we may assume that [tt] = 0. Also, changing the order of registers, if nec- 
essary, we may assume that u = 9i ■ • ■ Omff", where n -I- to = r. Consider a 
finite state automaton = (Q, go, F, L\) over {x \, . . . , x„} U 6>U {-x'S ■ X' C 
{a;i, . . . , x„}}, where transition relation A is defined as follows. For every tran- 
sition (g, k, S', q') G n, A contains: 

— (g,6*fe-jf/e,g'), if 1 < fc < to, or 

— < k <r, 

where X' = {xi : i m G S}. 

Remark 2. Note that the diagrams of A and AT^ differ only in the transition 
labels which can be recovered from each other in a straightforward manner. 



It should be pointed out that FMA languages are not closed under reversing, see [4, 
Example 8]. The proof of the corollary shows that this constitutes rather serious 
obstacle to find a kind of regnlar expressions for FMA languages. 
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The proof of the “if” part of Theorem 1 is based on the fact that set of the 
sequences of labels of the accepting paths is regular. Namely, the “if” part 
of Theorem 1 immediately follows from Theorem 2 below. 

Theorem 2. Let A and be as above and let a be an ordinary regular 

expression that defines Then, L{a) = L{A). 

Proof. (Sketch) Since T{ei.....e,„,a;i.....a;„}u{x'e:X'c{a;i....,a;„}}(a) = L{M^), it 
suffices to show that L{A) consists of all instances of the elements of L(M^). 

Let p = ei, . . . , Cn be a path of edges in the graph representation of A. One 
can think of p as the diagram of an FSUBA, also denoted by p. Then L{p) 
consists of all words of length n over E which “drive A through p from its first 
to the last vertex (state).” 

Let P denote the set of all paths p starting from the initial state and ending 
at a final state. Then 

L{A) = U L{p). 

pGp 

On the other hand, by Remark 2, p has the corresponding path in M^, also 
denoted by p, that differs from it only in the transition labels. 

The labels of p in the graph representation of form a UB-expression 
that we shall denote by Wj,. Therefore, 

L{M^) = {w^:pG P}. 

Consequently the equality of L{A) to the set of all instances of the elements of 
L(M^) will follow if we prove that for each path p 

L{w^) = L{p), (1) 

i.e., L{p) consists of all instances of Wj,. 

The proof of (1) is by induction on the length n of p and is omitted. 
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Abstract. One of important questions on quantum computing is wh- 
ether there is a compntational gap between the models that are allowed to 
use quantum effects and the models that are not. Several types of quan- 
tum computation models have been proposed, inclnding quantum finite 
automata and quantum pushdown automata (with quantum pushdown 
stack). It has been shown that some quantum computation models are 
more powerful than classical counterparts and some are not since quan- 
tum computation models are required to obey some restrictions such 
as reversible state transitions. In this paper, we investigate the power of 
quantum pushdown automata whose stack is assumed to be implemented 
as a classical device, and show that they are strictly more powerful than 
classical counterparts in the one-sided error setting. That is, we show that 
there is a non-context-free language which quantum pushdown automata 
with classical stack operations can recognize with one-sided error. 



1 Introduction 

One of important questions on quantum computing is whether there is a com- 
putational gap between the models that are allowed to use quantum effects and 
the models that are not. Several types of quantum computation models have 
been proposed, including quantum finite automata and quantum pushdown au- 
tomata [1-4]. In [2], it is shown that quantum pushdown automata can recognize 
every regular language and some non-context-free languages. However, it is still 
open whether quantum pushdown automata are more powerful than probabilis- 
tic pushdown automata or not. The quantum pushdown automaton model in- 
troduced in [2] has a quantum tape head and a quantum stack, and needs 0(n) 
qubits for realization, where n is the execution time. 

In [4] , we introduced another model of quantum pushdown automata, called 
quantum pushdown automata with classical stack operations (QCPDA), whose 
stack is assumed to be implemented as a classical device. This means that a 
QCPDA needs [log m] qubits for specifying the position of the head, where m 
is an input length, in addition to the constant number of qubits for represent- 
ing the finite state control. And we showed that QCPDA’s can simulate any 
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(even non-reversible) probabilistic pushdown automata with the same accep- 
tance probability as that of the original probabilistic pushdown automata in 
[4]. This implies that QCPDA’s are at least as powerful as probabilistic push- 
down automata. However, it remains open whether QCPDA’s are strictly more 
powerful than classical counterparts, i.e., whether QCPDA’s can recognize some 
languages which classical counterparts cannot recognize. 

In this paper, we show that QCPDA’s are strictly more powerful than classi- 
cal counterparts in the one-sided error setting. That is, we show that there is a 
language which a QCPDA can recognize with one-sided error. We also show that 
this language cannot be recognized by non-deterministic pushdown automata, 
that is, it is not a context-free language. This implies that it cannot be recognized 
by probabilistic pushdown automata with one-sided error, either. 

This paper is organized as follows: In Sect. 2, we define quantum pushdown 
automata with classical stack operations and probabilistic pushdown automata. 
In Sect. 3, we show that quantum pushdown automata with classical stack op- 
erations can recognize a certain language L\ with one-sided error. We also show 
that Ti is not a context-free language. Sect. 4 concludes this paper. 

2 Preliminaries 

In this section, we give a description of quantum pushdown automata with clas- 
sical stack operations (QCPDA) defined in [4]. A QCPDA has an input tape to 
which a quantum head is attached, and has a classical stack to which a classi- 
cal stack top pointer is attached. A QCPDA has a quantum state control. The 
quantum state control reads the stack top symbol pointed by the classical stack 
top pointer and reads the input symbol pointed by the quantum head. A stack 
operation is determined solely by the result of an observation of the quantum 
state control. We define QCPDA’s formally as follows. 

Definition 1. A Quantum Pushdown Automaton with Classical Stack Opera- 
tions ( QCPDA ) is defined as the following 8-tuple: 

M (Q, A, Pj (^0? Q acc: Q uej ^ ’ 

where Q is a set of states, S is a set of input symbols including the left and 
the right end-marker {i(:, $} respectively, P is a set of stack symbols including 
the bottom symbol Z , 6 is a quantum state transition function (S : {Q x E x 
P X Q X {0,1}) — > C}, go is an initial state, a is a function by which stack 
operations are determined (a : Q\{Q accOQ j.gj) — > P~^ U {— , pop} ) , Qacc (C Q) 
is a set of accepting states, and Qj-gj (C Q) is a set of rejecting states, where 
Qacc C Qfgj = 0 . □ 

6{q,a,b,q' ,D) = a means that the amplitude of the transition from q to q' 
moving the quantum head to D {D = 1 means ‘right’ and D = 0 means ‘stay’) 
is a when reading an input symbol a and a stack symbol b. A configuration of 
the quantum portion of a QCPDA is a pair (q, k), where k is the position of the 
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quantum head and g is in Q. It is obvious that the number of configurations of 
the quantum portion is n\Q\, where n is an input length. 

A superposition of configurations of the quantum portion of a QCPDA is any 
element of 12{Q x of unit length, where = {0, 1, . . . , n — 1}. For each 
configuration, we define a column vector jg, k) as follows: 

— \q,k) is an n\Q\ x 1 column vector. 

— The row corresponding to (g, k) is 1, and the other rows are 0. 

For an input word x (i.e., the string on the input tape between ^ and $) and a 
stack symbol a, we define a time evolution operator C/® as follows: 

Uai\Q,k))= S{q,x{k),a,q\D)\q\k + D) , 

q' GQ 

where x{k) is the fc-th input symbol of the input x. If [7® is unitary (for any 
a £ r and for any input word x), that is, = I, where 

is the transpose conjugate of C/®, then we say that the corresponding QCPDA 
is well- formed. A well-formed QCPDA is considered to be valid in terms of the 
quantum theory. We consider only well-formed QCPDA’s. 

We define the notion of “words accepted by a QCPDA” in the following. 
For that purpose, we first describe the way how the quantum portion and the 
classical stack of a QCPDA work. 

Let the initial quantum state and the initial position of the head be qo and 
’0’ respectively. We define as \ipo) = |<ZojO)- We also define as follows: 

= span{\q,k) |cr(g) = w}, 

Ea.cc = span{\q, k) \q G Qacc}, 

"^rej spQTi\^\qj k) \q G 

We define the observable O as O = (BjEj, where j is w £ F+ U {—,pop}, 
‘acc’, or ‘rej’. For notational simplicity, we define the outcome of an observation 
corresponding to Ej to be j. 

The computation of the QCPDA proceeds as follows: 

For an input word x, the quantum portion works as follows: 

(a) C/® is applied to \ipi). Let IV'i-i-i) = where a is the stack top symbol. 

(b) li’i+i) is observed with respect to the observable O = (BjEj. Let \(pj) be the 
projection of \ipi+i) to Ej. Then each outcome j is obtained with probability 
I \(j)j) p. Note that this observation causes \ipi+i) to collapse to jj^yj 
where j is the obtained outcome. Then go to (c). 

The classical stack works as follows: 

(c) Let the outcome of the observation be j. If j is ‘acc’ (‘rej’, resp.) then it 
outputs ‘accept’ (‘reject’, resp.), and the computation halts. If j is then 
the stack is unchanged. If j is ^pop\ then the stack top symbol is popped. 
Otherwise (j is a word in E~^ in this case), the word j is pushed. Then, go 
to (a) and repeat. 
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We call the above (a), (b) and (c) ‘one step’ collectively. 

For a language L, if there exists a constant £ (0 < £ < 1) that does not 
depend on inputs, a QCPDA accepts any word in L with probability greater 
than 1 — £, and it rejects any word which is not in L with certainty, then we say 
that L is recognized by the QCPDA with one-sided error. 

To check well-formedness, the following theorem is shown in [4]. 

Theorem 1. A QCPDA is well-formed if the quantum state transition function 
of M satisfies the following conditions for any a S P: 



( 1 ) yqi,q2,b 



X! S{qi,b,a,q',D) 6 {q 2 ,b,a,q',D) 

q',D 

(2) Vgi, 92 , 61,62 



1 (91 = 92) 
0 (91 ^ 92), 



^ (5(91, 61, a, 9', 0)(5(92, 62, a, 9', 1) = 0, 
9' 



where 5(-,-,-) is the conjugate ofS{-,-,-). □ 

For simplicity, we will handle only a subclass of QCPDA’s, called simplified 
QCPDA’s, such that we can decompose the quantum state transition function 
into two functions: one for changing states and the other for moving the quantum 
head. For a € S and 6 G T, we adopt a linear operator Va,t ■ h{Q) — h{Q) for 
changing states, and a function A : Q — > {0, 1} for moving the quantum head. 
In simplified QCPDA’s, direction of the movement of the head is determined 
solely by the state to which the current state makes a transition. Then the 
transition function 6 we are concerned with is described as follows: 



5(9, a, 6, q',D) 



{q'\Va,t\q) {A{q') = D) 
0 {A{q') ^ D), 



where {q'\ Va,b \q) is the coefficient of I9') in 14.6 \q}- The following theorem is 
also shown in [4] . 

Theorem 2. A simplified QCPDA is well-formed if for any a G S and b G P, 
the linear operator Va,b satisfies the following condition: 



X! ( 9 'l foa .6 | 9 l) (9'! K .6 I92) 



1 (91 = 92) 
0 (91 7 ^ 92) 



□ 



We define probabilistic pushdown automata as follows. 

Definition 2. A Probabilistic Pushdown Automaton (PPA) is defined as the 
following 7-tuple: 

A7 i^Q j 27, F, 5, 9o , Q acc^ Q , 

where Q is a set of states, S is a set of input symbols including the left and 
the right end-marker {(|;, $} respectively, P is a set of stack symbols including the 
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bottom symbol Z, S is a state transition function (S : {Q x S x F x Q x {0, 1} x 
r~^ U { — ,pop}) — > [0, 1]), qo is an initial state, Qacc{f= Q) is a set of accepting 
states, and Qj.gj{F Q) is a set of rejecting states, where Qacc H Qj-gj =0- □ 

S{q,a,b,q' , D,w) = a means that the probability of the transition from q 
to q' moving the head to D with the stack operation w is a when reading an 
input symbol a and a stack symbol b. Note that the sum of the weights (i.e. the 
probabilities) of outgoing transitions of a state must be 1. Computation halts 
when it enters the accepting or rejecting states. 

In [4], it is shown that QCPDA’s can simulate any PPA’s with the same 
acceptance probability as that of the original PPA’s. 

Theorem 3. A QCPDA can simulate any PPA with the same acceptance prob- 
ability as that of the original PPA. □ 

We give a sketch of the proof. 

Sketch of Proof. We say a PPA to be of post-state-dependent type if the stack 
operation and the direction of the head movement are determined solely by the 
next state rather than by the triple of (current) state, input symbol and stack-top 
symbol, where the next state is the state after transition. 

Let M = (Q, N', A, (5, go, Qacc, Qrej) ^ PPA. It is straightforward to see 
that any PPA can be converted to a PPA of post-state-dependent type. We 
assume that M is of post-state-dependent type without loss of generality. 

We make M into a reversible PPA by adding extra states and stack sym- 
bols, where reversible means that each state has at most one incoming transition 
for each stack symbol. Suppose that a state q has multiple incoming transi- 
tions for a stack symbol a, that is, there are at least two distinct transitions, 
S{qi,xi,a,q,D,w) = pi (pi yf 0) and 6{q2,X2,a,q,D,w) = p2 {p2 0). We 

consider the case of w G T'*'. The other cases are similar. For each such in- 
coming transition t : S{q' ,x,a,q,D,w) = p {p jb 0), we define a new state qt. 
We add a new stack symbol St to T and define 5 as 5 {q' ,x,a,qt,Q,wst) = p 
and 6{qt,x, St,q, D,pop) = 1, where wst is a concatenation of w and St. Then 
we remove the original transition, that is, we define as 5 {q' ,x,a,q,D,w) = 0. 
We illustrate an example of how to add new states in Fig. 1. The revised PPA 
is reversible. Note that the revised PPA is also of post-state-dependent type. 
Since each state has at most one incoming transition for each stack symbol, it 
is straight forward to see that the revised reversible PPA can be converted to a 
QCPDA, that is, the corresponding time evolution operator {7® can be unitary 
by defining thus far undefined transitions properly. □ 



3 On the Power of QCPDA’s with One-Sided Error 

In this section, we show that QCPDA’s can recognize the following language Li 
with one-sided error while probabilistic pushdown automata cannot with one- 
sided error. 
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Fig. 1. How to add new states. 
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Theorem 4. QCPDA ’s can recognize L\ with one-sided error. 

Proof. We show a QCPDA M that recognizes Li. The outline of M is as follows: 
M rejects any input which is not of the form uffv\\w\>x%y = {a,b}*ff{a,b}*\\ 
{a,b}*\>{a,b}*%{a,b}* . M consists of the three components Mi, M 2 and M3, 
each of which is a deterministic automaton. Mi processes the substring uffv\\wbx 
as follows: 

1. Ml reads u taking 2\u\ steps with the stack unchanged, 

2. pushes each symbol of v one by one, 

3. and reads w and x taking |w| + |a;| steps with the stack unchanged. 

M 2 processes the substring uffv\]w\>x as follows: 

1. M 2 reads u and v taking |m| + |ti| steps with the stack unchanged, 

2 . pushes each symbol of w one by one, 

3. and reads x taking 2\x\ steps with the stack unchanged. 

M3 processes the substring y checking whether the string on the stack is y^ or 
not. 

Ml and M 2 process the substring uffv'^w\)x% in parallel using superposition. 
Firstly, we consider the case that (|m| = |w| = |w| = |a;| and v = w) or (|u| = \x\ 
and |u| = |w| =0). In this case. Mi and M 2 push the same symbol at the same 
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Fig. 2. QCPDA M. 



time, and also read at the same time. Thus, when reading Mi and M 2 
interfere with each other and move to the rejecting state with certainty. (See 
Fig. 2). 

Secondly, we consider the case that ^(|u| = |f| = |w| = |a;| and v = w) and 
^(|u| = |a;| and |t;| = |w| = 0). In this case, the stack operations are different 
between Mi and M 2 at some time. Then the observation, the result of which 
corresponds to the stack operation, causes the superposition to collapse to either 
Ml or M 2 . Thus M moves to the third component M 3 with probability 1/2 with 
w or w on the stack. Thus \i v = or w = y^, M accepts the input with 
probability 1/4. 

We define M in detail in the following. 

M has the following states: 

” <71,11,1,-5 <71,«,0,-5 <71,#,0,-5 <71,15,1,-5 <71,15,0,05 <71,15,0,65 <71,U5,1,-5 <7l,a5,l,-- 
“ <72,11,1,-5 <?2,J5,1,-5 <72,-155,1,- 5 <72,115,0,05 <72,-115,0,65 <72,05,1,-5 92,05,0,-5 <72,%,0,-’ 

“ < 73 , 1 /, 1,-5 <73,y,0,pop- 

<705 < 7 occ 5 Qrejlj Qrej2j QrejSj QrejAj Qrejbi QrejQi Qrej7 1 QrejSj Qrej9i QrejlOi Qrejll- 

Qacc is an accepting state, qreji, ■ ■ ■ , qrejii are rejecting states. The index of qi^z,d,c 
means that the state is used to process a substring z in Mj, A{qi^z,d,c) = d, and 
c represents the value of (j{qi,z,d,c) as follows: the stack top symbol is popped 
(c = ‘pop’), the stack is unchanged (c = — ), and c is pushed (otherwise). The 
initial state is go- 

The state transition diagrams of the components Mi , M 2 and M 3 are illus- 
trated in Fig. 3. It is straightforward to see that the corresponding time evolution 
operators can be extended to be unitary by determining thus far undefined tran- 
sitions properly by Theorem 2. □ 
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M3 
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Fig. 3 . Ml, M2 and M3. Each edge label represents a pair of an input symbol and a 
stack top symbol, where is a wild card. 



We show that a non-deterministic pushdown automaton cannot recognize 
Li, that is, Li is not a context-free language. It is straight forward to see that 
this means that a probabilistic pushdown automaton cannot recognize L\ with 
one-sided error. 

Theorem 5. A non-deterministic pushdown automaton cannot recognize Li. 

□ 

We use Ogden’s lemma [5] to prove Theorem 5. 

Ogden’s Lemma. For any context-free language L, 3n € IN, such that Vz G L, 
if d positions in z are “distinguished, ” with d > n, then 3u, v, w, x, y such that 
z = uvwxy and 

1. vx contains at least one distinguished position; 

2. if r is the number of distinguished positions in vwx, then r < n; 

3. Vi G IN , uv^'wx^y G L. □ 
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Proof of Theorem 5. Suppose that Li were a context-free language. We consider 
a word z = pffq\\r\>s%t = G L\. Let each symbol in 

q be distinguished. Then z = uvwxy satisfies the conditions of Ogden’s lemma. 
If either v or x contains a separator (#, t], b, %), then z' = uv'^wx^y ^ Li. Note 
that vx contains at least one ‘a’ in q. Thus x must contains ‘a’s in t. (Otherwise 
z' = uv"^wx^y{= p'ffq'\]r'\>s'%t') ^ Li since q' yf and r' yf t'^). We assume 
that V contains c ‘a’s and cd= n\. Then uv'^~^^wx‘^~^^y{= p''ffq"^r''\>s”%t") ^ Li 
since \p"\ = \q"\ = \r"\ = |s"| and q" = r", a contradiction. Thus Li is not a 
context-free language. □ 

By Theorem 3, 4, and 5, we can conclude that one-sided error QCPDA’s are 
strictly more powerful than one-sided error probabilistic pushdown automata. 

4 Conclusion 

We have shown that QCPDA’s can recognize a non-context-free language with 
one-sided error. Since the class of languages recognized by one-sided error PPA’s 
is contained in the class of non-context-free languages, Li cannot be recognized 
by a one-sided error PPA. Thus, QCPDA’s are strictly more powerful than PPA’s 
in the one-sided error setting. Our conjecture is that L\ cannot be recognized by 
probabilistic pushdown automata even with two-sided error, which implies that 
QCPDA’s (with two-sided error) are strictly more powerful than probabilistic 
pushdown automata. 
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Abstract. In this paper we consider the complexity of learning mono- 
tone DNF formulae and Boolean circuits in an exact learning model 
called the teaching assistant model. This model, introduced in one of 
our earlier works, uses teaching assistant classes for classifying learning 
problems. Teaching assistant classes are a learning-theoretic analog of 
complexity classes. We show that monotone DNFs are learnable using a 
teaching assistant in the class SPP. On the other hand. Boolean circuits 
and general DNFs are not learnable using an SPP teaching assistant un- 
less NP C SPP. We also show that learnability of Boolean circuits with 
an assistant in the class NP will improve the Karp-Lipton collapse result 
to P>^^ 



1 Introduction 

In this paper we study the complexity of learning certain concept classes in an 
exact learning model called the teaching assistant model. The teaching assistant 
model, developed in our earlier work [3,4], is a refinement of Angluin’s well- 
known exact learning model. In Angluin’s model a learner learns a target concept 
by making either membership queries or equivalence queries (or both) to an 
honest teacher. In the teaching assistant model, a complexity-theoretic approach 
is taken towards learning. 

The teaching assistant model: A learning model based on complexity 
classes. A central idea in this model is the notion of a teaching assistant] an 
intermediate agent between the learner and the teacher. The learner, instead 
of querying the teacher directly, queries a teaching assistant to learn the target 
concept. The teaching assistant in turn makes membership queries to the teacher 
to gather information about the target concept. The usefulness of this model for 
investigating the complexity of learning problems comes from the fact that we 
can classify teaching assistants into teaching assistant classes, which is similar 
to the way we classify computational problems into complexity classes. 

This complexity-theoretic model for learning is closely related to Angluin’s 
exact learning model. An equivalence query to a teacher can be replaced by a 
sequence of queries to an NP oracle, which in turn makes membership queries to 
the teacher. In particular, an NP-like set {{y, r) j y is a prefix of a counter exam- 
ple to the hypothesis r} can be used to prefix-search for a counter example for an 
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equivalence query r. This NP oracle is an example of a teaching assistant. Hence, 
we can say that classes learnable with equivalence queries in Angluin’s model 
are also learnable in the teaching assistant model with an NP teaching assistant. 
Similarly, a membership query can be seen as a query to a teaching assistant 
in the class P. Therefore, any concept classes learnable with only membership 
queries in Angluin’s model are also learnable with a P teaching assistant. 

The main advantage of restating the learnability using queries in the language 
of NP and P is that we can generalize this idea to other complexity classes. For 
example, consider the complexity class UP. UP is the class of decision problems 
accepted by nondeterministic machines with a restriction that there is at most 
one accepting computation for any instance of the problem. The complexity of 
problems in UP is believed to be strictly between P and NP. Hence we can say 
that the concept classes that are learnable using a teaching assistant in the class 
UP have less learning complexity than concept classes that require assistants 
from the class NP. In a certain well-defined sense, the complexity of teaching 
assistants quantify the complexity of learning. In general, we can define a teach- 
ing assistant class analogous to any complexity class. Hence, introducing this 
intermediary called the teaching assistant in the exact learning model provides 
a framework which is suitable for analyzing the complexity of learning prob- 
lems. We call this framework the teaching assistant model (TA model) for exact 
learning. 

In this paper we are interested in the teaching assistant classes analogous 
to the complexity classes NP and SPP. While NP is a well-known class, some 
motivation needs to be given for studying the class SPP. 

The complexity class SPP and its significance. SPP was introduced and 
studied by Fenner, Fortnow, and Kurtz in [8]. This class was also independently 
studied in [14] under the name XP and in [9] under the name ZUP). SPP 
consists of decision problems that are accepted by nondeterministic polynomial 
time Turing machines such that; for positive instances, the difference in the 
number of accepting and rejecting paths of the machine is exactly one, and for 
negative instances, this difference is zero. SPP can be seen as the gap (difference 
between accepting and rejecting paths) analog of the class UP. 

From a structural point of view, this class is interesting mainly because of 
its lowness property. A complexity class C\ is said to be low for another (rela- 
tivizable) complexity class C 2 if = C 2 . That is, any C' 2 -machine that uses a 
language in Ci as an oracle can be converted into another (72-machine with the 
same acceptance behavior without any oracle access: Ci is powerless as oracle to 
C 2 ■ It is shown in [8] that SPP is low for counting complexity classes such as PP, 
C=P, and ModfcP for k > 2. Thus, intuitively, SPP is a class of “low counting 
complexity” . 

Although SPP is a structurally defined class, it has been proved to be very 
useful in analyzing the complexity of “intermediate problems”- problems that 
are not known to be in P, but unlikely to be NP-complete. Membership of a 
problem in the class SPP can be seen as evidence that the problem is unlikely 
to be NP-complete: no known NP-complete problem has been shown to be in 
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SPP. In [13, 15] it is shown that many of the known candidate problems of 
intermediate complexity from the domain of algebra are in the class SPP. Very 
recently, Arvind and Kurur [2] presented an elegant SPP algorithm for the well- 
known Graph Isomorphism problem. 

The exact power of SPP in relation to NP is not yet clear. Because of the 
strong restrictions on the computation tree of nondeterministic Turing machines 
accepting SPP languages, it seems surprising that NP could be in SPP. Further- 
more, there are oracles separating NP from SPP [5]. On the other hand, results 
from derandomization and resource bounded measure theory give some evidence 
towards the containment of NP in SPP [12, 11]. 

Prior results on teaching assistant model. In [3], the teaching assistant 
framework was introduced primarily to distinguish the complexity of learning 
certain algebraic concepts such as vector spaces and permutation groups from 
the nonalgebraic concept class CNF. In particular, it was shown that vector 
spaces can be learned using teaching assistants in the class SPP. Learning CNFs 
using a teaching assistant in the class SPP is not possible, unless NP C SPP. A 
subsequent work initiated a study of teaching assistant classes analogous to many 
well-studied complexity classes such as P, UP, NP, (UPflco-UP), (NPHco-NP), 
SPP, LWPP, and established their importance in classifying the complexity 
of learning [4]. In particular, [4] gave a fine separation among many of the 
teaching assistant classes mentioned above. For example, this paper presented a 
concept class that is learnable using a (UPflcoUP) teaching assistant, but not 
learnable with teaching assistants in P. Note that, unlike in complexity theory 
where establishing proper separations among complexity classes is extremely 
difficult, we can actually show separations among TA classes. 

New results in this paper. In this paper, we prove new results on the com- 
plexity of learning monotone DNFs and Boolean circuits in the TA model. First, 
we show that monotone DNFs are learnable using teaching assistants in the class 
SPP. On the other hand, general DNFs and Boolean circuits are not learnable 
using teaching assistants in the class SPP unless NP C SPP. We also show that 
if Boolean Circuits are learnable with an NP assistant then we can improve 
the current best upper bound of the Karp-Lipton collapse result; under this as- 
sumption we show that SAT G P/poly PH C P^’’. The current best upper 
bound is S^; a complexity class that contains P”'’. Previously, it was known 
that SAT G P/poly PH C ZPP”'’ (attributed to Watanabe in [6]). More 
recently, Cai showed that S 2 C ZPP”'’ [7]. Cai’s result together with the result 
that SAT G P/poly PH C S 2 (attributed to Sengupta in [7]) improved the 
Karp-Lipton upper bound from ZPP*^^ to S 2 . There are relativized worlds where 
SAT has polynomial size circuits and PH % P'^’’ [10,16]. Therefore, it appears 
that in order to prove learnability of Boolean circuits with NP-assistants, we 
need non-relativizing techniques. 

Because of the space limitations, we only give sketches of the proofs in this 
extended abstract. Some of the proofs are omitted completely. 

In complexity theory, the class SPP has been used successfully in study- 
ing computational problems with intermediate complexity. The results in this 
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paper help to further establish the role of the class SPP in understanding the 
complexity of learning problems. 



2 Notations, Definitions, and Technical Preliminaries 

We follow standard language-theoretic notations in this paper. For sets Xi and 
X2, Xi \ X2 denotes their set difference. Let A denote the empty string. For 
2/1, 7/2 G A7*, 2/1 < 2/2 means that 2/1 is lexicographically smaller than or equal to 
2/2. Let Z denote the set of integers and N the set of natural numbers. 

Complexity theory. We use standard definitions and notations from complex- 
ity theory. In particular, oracle Turing machines play a central role in many 
of the notions in this paper. Please refer to standard text books for these def- 
initions and notations. For a nondeterministic polynomial time (in short, NP) 
machine M, uccm : A 7 * — > N denotes the function defined as: accM{x) = the 
number of accepting paths of M on input x. #P denotes the class of func- 
tions {accM|Af is an NP machine}. gapM '■ X* ^ Z denotes the function de- 
fined as: gapM{x)= (number of accepting paths of M on input x - the num- 
ber of rejecting paths of M on input x). GapP denotes the class of functions 
{gapM\M is an NP machine} [8]. GapP is the closure of #P under subtraction. 
We will make implicit use of the following closure properties of GapP: a) the 
product of a polynomial number of GapP functions is a GapP function, b) the 
sum of a polynomial number of GapP functions is a GapP function (see [8] for 
details) . 

Using the notion of gaps we can define the complexity class SPP as follows. 
A language L is in SPP if there is a function / in GapP such that; f{x) = 1 
if a: G L and f{x) = Q x ^ L. For the purpose of comparison, we give the 
definition of the class UP. A language L is in UP, if there is a function / in 
#P such that; f{x) = I if x € L and f{x) = 0 if x ^ L. Thus, SPP generalizes 
the class UP. The inclusions P C UP C SPP are a direct consequence of the 
definitions. 

Learning theory. We only deal with the learnability of finite concepts. A rep- 
resentation class V is a, tuple (i?, p, p) where p is a polynomial, R C S* and p 
is a collection p = such that Pn '■ R ^ is a many-one mapping. 

Any element in the range of p is called a concept of V. The set of all concepts 
of V is called the concept class represented by V. When there is no confusion, 
we denote the concept class represented by V also by V. For any n, let Vn 
denote the concepts of V in For any concept c & Vn let size{c) denote 

min{|r| : p„(r) = c}. 

Although Angluin’s query model is very closely related to our model, we omit 
the definitions for this model since we do not use it in any essential manner. In- 
stead, we directly give formal definitions related to the teaching assistant model. 
Please refer to Angluin’s original paper for definitions related to this query model 
[1]. We start with the abstract definition of a teaching assistant. 
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Definition 1. Let V be a representation class. Let Q : N x Nx r*x2^‘ ^ {0,1} 
be a predicate. A tuple (n, a:, c) G N x N x x 2^ , is said to be V -valid if c & 
Vn and size{c) < 1. The teaching assistant defined by Q w. r. t. the representation 
class V is the set L{V) = {{n,l,x,c)\{n^l^x^c) is V -valid and Q{n,l,x,c)}. 

Let V = {R,p,,p) be a representation class and L{V) be a teaching assistant 
for V. A learner is a deterministic oracle Turing machine transducer. The learning 
algorithm for V comprises of a pair - a learner a, and a teaching assistant 
L{V). The teacher selects a target concept c G Vn- The learner a, on input 
(0”,0^), has to output a representation r G R such that iXn{r) = c, provided 
that size{c) < 1. a can query the teaching assistant L{V). When a wants to 
query the teaching assistant about {n,l,x,c), it writes x on its oracle tape. 
Notice that other parameters, n, I and c, are fixed prior to learning and are 
implicitly communicated to the teaching assistant. For such a query x by the 
learner, the learner receives the answer ‘Yes’ if (n,l,x,c) G L{V) and ‘No’ if 
(n, I, x, c) ^ L{V). After a finite number of steps, a outputs a representation of 
the target concept c. V is said to be learnable with an assistant L{V) if there is 
a learner a for V with queries to L{V) as teaching assistant. We say a learns V 
with teaching assistant L{V). 

Let 7^ be a representation class. Let a be a learner learning V with a teaching 
assistant L{V). For an input (0",0*), the time complexity of a learner is the 
number of steps it takes before it writes down a representation of the target 
concept. V is said to be deterministic polynomial-time learnable with a teaching 
assistant L{V), if there is a polynomial q and a deterministic learner a learning 
V with L{V) such that for all n, and for all c G Vn, on input (0”, 0^), the running 
time of learner a learning c, is bounded by q{n -\- 1). Adapting the complexity- 
theoretic notation FP for functional polynomial time, we denote polynomial-time 
learnability by FP-learnability. 

In order to quantify the complexity of teaching assistants we can define classes 
of teaching assistants with respect to a given representation class. These classes 
can be defined analogous to some well-studied complexity classes. In this paper 
we are interested in the teaching assistant class analogous to the complexity 
classes SPP and NP. We define the teaching assistant class SPP. The class NP 
can be defined analogously. 

Definition 2. Let V be a representation class and L{V) be a teaching assistant. 
L{V) is said to be in the class SPP(T^) if there exists a polynomial-time non- 
deterministic oracle Turing machine M which for any V -valid tuple {n,l,x,c), 
on input (0",0*,a;), uses c as oracle and produces a gap=l if (n,l,x,c) G L{V) 
and gap=0 if (n,l,x,c) ^ L{V). The behavior of machine M is not specified on 
inputs that are not V -valid. 

We have the following definition of learnability with teaching assistant classes. 

Definition 3. Let C denote a teaching assistant class. We say a representation 
class V is TV -learnable with a C -assistant if there exists a teaching assistant 
L{V) G C(fP) and an TV -learner a, such that a learns V with the teaching 
assistant L{V). 
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The following theorem relates the TA model to Angluin’s model. 

Theorem 1 ([4]). Let V he a representation class. Then V is FT -learnable with 
membership queries if and only if it is FF-learnable with a F-assistant. IfV is 
FF -learnahle with membership and equivalence queries, then it is FF -learnahle 
with an FiF -assistant. 

3 Learning Using SPP Assistants 

In this section, we give a learning algorithm for monotone DNF using a teaching 
assistant in the class SPP. We also note that general DNFs and Boolean circuits 
are not learnahle using SPP unless NP C SPP. Note that, from Theorem 1 and 
the fact that monotone DNFs are learnahle with membership and equivalence 
queries, they are also learnahle with an NP-assistant. 

Let mDNF denotes the concept class of monotone functions represented by 
monotone DNF formulae. Let CIRCUITS denote the representation class of 
boolean circuits. For ease of presentation, we define the size of a concept c 
in mDNF as the number of prime implicants of c. This is only polynomially 
different from the regular notion of size and does not affect our algorithm. 

Theorem 2. The representation class mDNF is FF-learnable with an SPP- 
assistant. 

Proof. (Sketch) We will first present a teaching assistant L(mDNF) and show 
that there exists an oracle machine M with the required gap behavior accepting 
L(mDNF). We then present a learner LEARNER-mDNF which uses L(mDNF) 
to output a monotone formula for the target concept. 

The learning algorithm that we give is an adaptation of Angluin’s algorithm 
for learning monotone DNFs using membership and equivalence queries. Our 
algorithm works as follows. Let c be the concept represented by a monotone for- 
mula (j). As in the case of Angluin’s algorithm for mDNF, the learning algorithm 
will find out each of the prime implicants of <f> one by one. This is done by doing 
a prefix-search with the aid of the teaching assistant. In order to make the TA in 
SPP, the prime implicants are prefix-searched in the order of increasing number 
of variables in them. The teaching assistant that we use for the learner is defined 
as: 

L(mDNF) = {{n,l,4>,k,y,c)\(j) is a disjunction of monotone terms; there are no 
X G (c\(j)) having less than k Is; there exists z G (c\(l>) having exactly k Is; y 
is a prefix of lex- first such z} 

Claim. L(mDNF) G SPP(mDNF) 

Proof (of claim). The nondeterministic oracle machine M that witnesses 
L(mDNF) in SPP(mDNF) is defined below. M uses an auxiliary oracle non- 
deterministic machine N 2 as subroutine, which in turn uses Ni . We define these 
two auxiliary machines first. 
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MACHiNEA^f ((n, I, 4>, i, k, y)) 

1. Guess in lex- first order strings zi, Z 2 , ■ ■ ■ , Zi 

2. Accept if all the following conditions are met 

3. j/ is a prefix of z\ 

4. 2 i, 22 , . . . , 2 i have exactly k Is 

5. zi, Z 2 , ■ ■ ■ , Zi are in c (using membership queries to c) 

6. 21 , 22 , . . . , 2 i are not in <j) 

7. Else Reject 

MachineAj ((n, I, <j>, k, y)) 

1. Produce a gap of o-ccNf({n, I, k,y))U‘j^i+i (l - accNf({n, I, 4>, j, k,y))) 
Machine I, 0, k, y)) 

1. If (() is not a disjunction of monotone terms produce a gap=0 

2. Else produce a gap of gapj^c ((n, I, <j>, k, y)) (^1 - gaPN^ ({n, I, 4>, k' , y))^ 

Let c be a target concept fixed by the teacher with size(c) < 1. Let ko be the 
number of variables in the smallest (in terms of the number of variables) prime 
implicant of c that is not in <j). Let to be the number of such prime implicants 
with /co variables. 

We will argue that M produces a gap=l if ((n, I, <j), k, y, c)) G L(mDNF) and 
produces a gap=0 if ((n, I, k, y, c)) ^ L(mDNF). Since if (j) is not a disjunction 
of monotone terms M produces a gap=0, we will assume otherwise for the rest 
of the proof, (n, I, (j>, k, y, c) G L(mDNF) if and only \i k = k^ and y is a prefix 
of lex-first string z G {c\ (j>) with exactly k ones. We need to show that 
produces a gap=l in this case and produces a gap=0 otherwise. For brevity of 
presentation, we only present the case when y is the prefix of the lex-first string 
2 G {c\4>) with exactly k ones. The other case can be argued similarly. 

We first observe certain acceptance behavior of N 2 which is stated as the 
next claim. The proof follows from the descriptions of Ni and N 2 - 

Claim (subclaim). 

1. If fc < ko, then gapjqc{{n, I, (j), k, y)) = 0. 

2. If fc = ko, then gap^c{{n,l, (j>, k,y)) = I. 

Proof (sketch for subclaim). 

1 fc < fco. In this case Nf will have 0 accepting paths. Hence, whatever be 
the value of i, gapjqc{{n, I, (j>, i, fc, y)) = 0. Hence, the total gap produced by 

= 0 . 

2 fc = fcg. Split the sum in line 1 of the description of Aff into three cases: 

i < io, i = io, and i > io. For i > io, N( will have 0 accepting paths and 

hence that part of the sum contributes 0 to the overall gap. For i = io, 
N( will have exactly 1 accepting path and for this case each of terms in the 
product contributes 1. Thus the contribution to the overall gap in this case is 
1. For the case i < io, consider the term uU+i (1 — accNc[{n, I, (f>,j, fc, y)))- 
Since i < io, there is term in this product with j = io. For this value of j = 
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to, accN^{{n,l, k,y)) = 1 and hence (l — accN^{{n,l,(f>,io,k,y))) = 0. 
Hence the contribution from this case to the overall gap =0. Therefore, the 
total gap=l. 

Case 1: k = ko. In this case we need to show that gapj^c = 1. Consider the 
gap expression in the line 2 of the description of M‘^. From the above claim, 
gapjsr^{{n,l,(j),j,k,y)) = 1 and for all k' < k - 1, gapj^c{{n,l,(j>,j,k',y)) = 0. 
Hence the overall gap produced is 1. 

Case 2: k < ko- We need to show that gapj^c = 0. From the above claim, the 
term gapj^c{{n, I, k, y)) = 0 and hence the overall gap produced is 0. 

Case 3: k > ko- (This case is similar to the case t < zq in the above claim) 
In this case we need to show that gapj^c = 0. Consider the product term 

~ 2/))^ of the gap expression in the line 2 of the 

description of Af^. Since k' runs from 1 to fc — 1, if A: > fco there is a term 
— gapj^c ((n, I, <j), ko, y))^ that appear in this product. Now from the subclaim, 

for this value of k' , — gapj^c {{n, I, (j), ko, y))^ = 0 and the product term, and 

consequently the overall gap, is 0. 

The following algorithm learns mDNF in polynomial time by making only 
0{ln) queries to the teaching assistant L(mDNF). In the algorithm, for a binary 
string y G {0, 1}”, Y denotes the the corresponding monotone term. 

LEARNER-mDNF(0", o') 

1. (j>^F 

2. k^l 

3. while (0”, 4 >, k, X,c) ^ L 

4. k ^ k + 1 

5. if fc = n + 1 

6. then output (f) and stop 

7. y^A 

8. for j = 1 to n do 

9. if (0",^i.,A:,yl,c) e L 

10. then y ^ yl 

11. else y ^ yO 

12 . (j>^(l>\/Y 

13. goto 2 

Finally we note that any representation class for which the equivalence prob- 
lem (checking whether two strings represent the same concept) is co-NP-complete 
cannot be learned using an SPP assistant unless NP C SPP. We state this as our 
next theorem for the case of CIRCUITS and general DNFs. The proof essentially 
follows the line of argument for the proof of a similar result ([4], Theorem 7.4). 

Theorem 3. If the representation classes of CIRCUITS or DNF are learnable 
with an SPP-assistant then NP C SPP. 
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4 Learning with NP Assistants 

In this section we consider learnablity using NP-assistants. Since learning with 
NP-assistants generalizes learning in Angluin’s model with equivalence + mem- 
bership queries, it is one of the most interesting teaching assistant classes. 

Theorem 4. //CIRCUITS are FF-learnable with an NF -assistant then SAT G 
P/poly ^ PH C P'^P 

Proof. (Sketch) We will present the proof sketch here. We will use the self- 
reducibility structure of SAT to prove the theorem. Under this assumption we 
will show how to construct a circuit for SAT using P”^ machine. The construction 
is done inductively. For every n let Cn be the circuit that accepts SAT with n 
variables. 

For the sake of simplicity, we omit the size parameter I in this proof. Let a 
be a learner, which on input 0" learns a polynomial size circuit for SAT with 
n variables with an NP-assistant L(CIRCUITS) = {{n,y,c)\y G S*-,Q(n,y,c)}, 
where Q is the predicate defining the NP-assistant. Let the oracle machine N 
accept L(CIRCUITS). We will focus on a’s computation when the target circuit 
is the circuit for SAT. 

Consider the following machine N' (now without oracles) which on input 
(0”,y, C), simulates N with target concept SAT. Whenever N makes an oracle 
query / to SAT, it constructs two formulae /o and /i where fi for i G {0, 1} 
is the formula / with the variable replaced with i. Then, it checks whether 
C(fo) = 1 or C(fi) = 1. If yes then it continues simulation as if the query / is 
answered “YES”; otherwise it proceeds as if the query / is answered “NO”. Let 
L be the NP language accepted by this machine N' . 

Now consider the following FP machine M that constructs the circuit Cn for 
SAT inductively using the NP language L. When n is a constant, M exhaustively 
searches for the circuit for SAT. Now assume that M has constructed 0„_i. M 
simulates the learner a with SAT as target concept. Whenever a makes a query 
y to the teaching assistant, M makes a query (0”, y, Cn-i) to L and continues 
simulation. It is easy to see that M constructs the circuit for SAT. Therefore 
if CIRCUITS is learnable using an NP-assistant, then if SAT G P/poly, SAT 
has P’^^-uniform circuits. Now, using standard techniques, it can be shown that 
AJ C P”'’, which implies PH C P”'’. 

It is known that for any class C which contains NP, SAT G P/poly PH C C, 
then C has languages with polynomial circuit complexity for any fixed polyno- 
mial. We have the following corollary whose proof is omitted. 

Corollary 1. //CIRCUITS is learnable with an ISF -assistant, then for any k, 
P”’’ has languages that does not have circuits of size . 

Perhaps the most important concept class that is worth investigating in the 
TA model is general DNF formulae. An open problem regarding DNF learning 
in Angluin’s model is whether equivalence -I- membership queries are power- 
ful enough for learning DNF formulae. A natural step towards progress in this 
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problem is to investigate it in a more general learning framework than Angluin’s 
model. The TA model with NP teaching assistants provides exactly such a frame- 
work. Are general DNF formulae learnable using an NP assistant? 
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Abstract. In machine-learning, maximizing the sample margin can re- 
duce the learning generalization-error. Thus samples on which the target 
function has a large margin (7) convey more information so we expect 
fewer such samples. In this paper, we estimate the complexity of a class 
of sets of large-margin samples for a general learning problem over a fi- 
nite domain. We obtain an explicit dependence of this complexity on 7 
and the sample size. 



1 Introduction 

Over the course of the last decade or so the field of learning theory has benefited 
from a rich body of work and the inception of a few key mathematical quanti- 
ties which concisely capture and describe the complexity of accurate learning. 
Among those that characterize the problem of learning pattern classification is 
the Vapnik-Chervonenkis (VC) dimension. Based on the VC-dimension, upper 
bounds on the worst-case deviation of the learner’s error from the optimal error 
may be obtained. The bounds are a function of the sample size m and some 
complexity term which is dependent on the VC-dimension of the function class. 
They are independent of the actual training-sample and hence become useful, 
i.e., have a value between 0 and 1, only for very large sample sizes^. Thus in 
this respect they are weak. More recently ([10, 2, 3, 9]) sample dependent bounds 
have been shown to be stronger. They hold when the learner finds a hypothesis 
which maximizes the margin on a training sample over a class of functions. 

Being that large-margin samples may yield better hypotheses (having lower 
error estimates) then they convey more information about the target. Intuitively, 
we expect that having more information about the target should result in fewer 
possible hypotheses. Since each hypothesis is associated with a particular set of 
training samples based on which the learner may infer it then we expect there 
to be fewer possible samples with increasing amount of information, i.e., with 
an increase in the margin. 

In order to try to formalize these ideas we take a combinatorial approach 
aiming to estimate the cardinality of the class of sets of large-margin samples 

^ This has recently gotten significant attention from the computational learning field, 
see for instance the papers at the NeuroCOLT (2002) Workshop on Generalization 
bounds less than 1/2. 

K.-Y. Chwa and J.I. Munro (Eds.): COCOON 2004, LNCS 3106, pp. 198-209, 2004. 
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(where each set is associated with an hypothesis). Its logarithm is a natural mea- 
sure of the complexity of any such sample. Our main technical result (Theorem 
3) provides an estimate of this quantity. 

We start with some notations and definitions. 

2 Basic Notations and Definitions 

Let X be a domain. For o G R, define sgn(a) = -1-1 if a > 0 and — 1 if a < 0. 
The following definitions can be found for instance in [1] . For a class A of real- 
valued functions on X the Pseudo-dimension, denoted as dimp{A), is defined 
as the maximum cardinality m of a sample S = . . . ,Xi^} C X such that 

there exists a translate vector r = [ri,r2 , . . . ,rm] G R™ where for each vector 
V G { — 1, 1}™ there is a function a„(-) G A that satisfies sgn(a„(a;ij) — rj) = vj 
for 1 < j < TO. The sample S is said to be shattered by A. In the special case 
where A consists of mappings from X — > {—1,1} and = 0, 1 < j < to, then 
dimp(^) is called the Vapnik-Chervonenkis dimension of A and is denoted by 
VC {A). 

For any 7 > 0 the ^-dimension of A, denoted as fat-y(A), is defined as the 
maximum cardinality to of a sample S = {a;^, . . . ,Xi^} C X such that there 
exists r = [ri,r2 , . . . ,rm] G R™ where for each vector v G (—1, 1}™ there is a 
function a«(-) G A that satisfies Ov(xy) > rj -h j if Vj = 1 and av{xi^) < Vj — j 
if Vj = — 1, for 1 < j < TO. The sample S is said to be ^-shattered by A. 

For B > 0 let H denote a class of real- valued functions h from X into [0, S]. 
Let S = X™ consist of all sets S C X with cardinality 151 = to. For any 
S = {xi ^ , • ■ • , Xi^ } define the margin of ft- on 5 to be 

ds{h)= inin \h{xi^)~ B/2\. 

Denote by I(^) the indicator function which takes the value 1 for any x & A 
and zero otherwise. For brevity, we sometimes write I(|ft(x) — B /2\ — c) instead 
of I(a: G {z : \h{z) — B/2\ — c> 0}). Henceforth X is assumed to be finite and 
well-ordered and our result will depend^ on |X|. All logarithms are taken with 
respect to the base 2. 

3 Overview of the Problem of Learning 

The generalized Probably- Approximately-Correct (PAC) model of learning from 
examples [5] can be used to represent learning classification. Under this model, 
data used for training and testing is generated independently and identically 

^ In this respect our analysis differs from standard learning models which admit an 
infinite domain. However, our work here is part of an ongoing research on compar- 
ing the value of different kinds of information-inputs for learning and parallels the 
work of Kolmogorov on algorithmic complexity of information where quantities are 
expressed in terms of the length n of binary sequences that describe some finite 
domain X of size |A| =2". 
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(i.i.d) according to an unknown but fixed probability distribution P which is de- 
fined over the input/output pairing (x,y) € X x {—1, 1} where the two possible 
classification categories are labeled as — 1 and 1 . In general, y need not be a func- 
tion of X however for the purpose of our work here we assume y = sgn(t(a;)) where 
t G Ti. is some unknown target function in a class Ti. of real-valued functions on 
X. To measure the misclassification error by any hinTi. the natural choice is the 
empirical error which is based on a randomly drawn sample Sm = { {xi ^ , yi ^ ) i 
according to P. It is defined as 

^ m 

Lm{h) = — ^I(sgn(/i(a;iJ) yf yi^). 

7Tl . 

The (true) misclassification error of h is L{h) = P{{x,y) : sgn{h{x)) yf y) and is 
unknown to the learner as P is not known. According to this model, the process 
of learning amounts to finding a hypothesis h G Ti. which minimizes the empirical 
error over Ti, i.e., 

Lm{h) = min Lm{h). 
hen 

The minimal true-misclassification error is clearly zero, i.e., L{t) = 0. But in 
general the error of learning, L(h), may be greater than zero. One of the main 
aims of the research in this field is to understand which settings, i.e., different 
learning algorithms and hypothesis classes Ti, yield lower error L{h). Theoretical 
estimates of L{h) exist for various scenarios ([!]), the simplest being the pure 
PAG setting. This is described in the following result which appears as Theorem 
4.1 in [4]. 

Theorem 1. Let Ti he a class of functions from X to { — 1, 1} with VC{H) = d. 
For any probability distribution P on X x { — 1,1}, let S = {{xi^ , yi^)}JLi be 
drawn according to P. Based on S, suppose h G Ti satisfies Lm{h) = 0. Then 
for any (5 > 0, with probability 1 — i5, 

^ 0 X ^P7Y> ^ \ 

L{h) < e(m, d,S) = — ( dlog —— + log ^ ) 

m \ d 0 J 

provided m > maxjc?, 2/e(m, d, d)}. 

Consider now the scenario where the learner obtains a sample S of cardinality 
m on which it is known that the target t has a large margin, i.e., ds(t) > 7. 
Support Vector Machines and other Kernel-based learning methods which use 
the principle of maximing the margin ([10,4]) can be represented in this way. In 
general, the 7-dimension fat.y(7d) of a class Ti decreases with an increase in 7 so 
as the following result suggests, the error of the classifier is significantly reduced 
with a larger margin (this result appears as Corollary 4.14 in [4]). 

Theorem 2. Let Ti consist of real-valued functions from X to [0, 1] and fix a 
7 > 0. Consider doing classification by thresholding functions h at 1/2, i.e., 
sgn{h{x) — 1/2), and denote by Lm{h), L{h) the corresponding empirical and 
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true misclassification errors, respectively. For any probability distribution P on 
X X { — 1, 1}, let S = {{xij , yij)}^i be drawn according to P. Based on S suppose 
that h G H satisfies L^{h) = 0 and ds{h) > 7. Then for any (5 > 0, with 
probability 1 — 

/ 7 2 / 1 8em , 32m , 4\ 

L(h) < e(m. d, a) = - log log — + >«e ; j 

provided m > max{2/e(m, d, (5),/af^/g(7i)}. 

This result is an example of a sample-dependent bound since in reality the 
value of 7 is the margin achieved by h which obviously depends on the data. 
It is apparent that large-margin samples are worth more for learning since the 
bound decreases with increasing 7. 

Our interest now is to estimate the complexity of large-margin samples. In 
order to characterize this quantitatively, starting in the next section our approach 
is to consider all samples S' C X of size m on which there exist hypotheses h G Tt 
that have a margin 7 > 0. We estimate the number of distinct sets Ah generated 
hy h G H where for a fixed h, Ah consists of all samples S G S such that 
ds{h) > 7. Since for a fixed h, any S G Ah may be used to learn h to within the 
same error bound of Theorem 2 (with 1 — (5 confidence) then Ah essentially is a 
set of ‘equivalent’ samples of size m. 

4 Hyperconcepts 

The space S consists of all samples S' C X of size to. On S consider sets of the 
form 

4/37 “ ^ ^ ■ d,s{h) > /?}, (3 > 0 

where the dependence on to is omitted in this notation. We refer to indicator 
functions on S which take the form 

/r'^,/,(S) = 1(4/37) 

as hyperconcepts and we sometimes write just h'. 

For any fixed margin parameter 7 > 0 define the hyperclass 

h; = {h'^^h --hGU}. (1) 

In words, Ttiy consists of all sets of samples S C X of cardinality to on which the 
corresponding hypotheses h (there may be several hypotheses associated with 
the same set) have a margin of at least 7. 

Considering 7, to and H as given and fixed but allowing the possible training 
sample S' G S to vary then the quantity log represents the description length 
of any hyperconcept h' G TL'.y and is thus a measure of the richness or complexity 
of the hyperclass TL'.y. Alternatively stated, it is the complexity of indexing the set 
t' (corresponding to the target t) which consists of 7-good samples each of which 
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may be chosen to train and learn t and yield an error which is bounded as in 
Theorem 2^. Note that by definition of the log \H'^ \ decreases with increasing 
7 which is also intuitive from a learning perspective since larger-margin samples 
produce better error-bounds hence are implicitly more informative about t so 
there are fewer possible hypotheses and hence fewer possible sets of samples 
that induce them via learning. Our interest in this paper is to obtain an explicit 
dependence of this complexity on 7 and m. 

The following main result of the paper gives an estimate of this dependence. 

Theorem 3. Let B he a finite positive real number. For any 1 < m < |Jf| and 
7 > 0, consider a class H of real-valued functions from X to the bounded interval 
and let 1 < fat^ififK) < |Jf| — m. Then 



log < fat..^ I ie{H) log 



( 'If 

V ifatj/ieiH) J \ 72 




-b2. 



The next section contains the technical work of the proof of this theorem. 



5 Proof of Theorem 3 



Viewing Ti.'y as a class of boolean concepts on S then, in general, it may have 
a limited separation ability over any hypersample C,m = , . . . , with 

€ S, 1 < j < N and 1 < V < |S|. The growth function introduced 

by [11], captures its separation ability. It is defined as: 



= maxilH:,(Cw) = max 






(2) 



Since S is finite, then viewing S as a set fr with r = jSj we have the following 
simple bound: < 7 T- 7 y^^(|S|). Our approach is therefore to upper bound the 

growth function U-u'^{N). 

Let V be a positive integer and consider any hypersample C,n = , 

c S. Denote by 5'^^ the point in the sample based on the ordering 
of the elements of S^d) (which is induced by the ordering of the elements in V). 
Then 



( 3 ) 



I 


I ( min \h{x) - B/2\ - 7 ^ , 




:hen\ 


1 


\xeSO) J 

1 “ 


\xesm J\ 


J 


1 


1=1 

1 

to 

1 


m 

...,ni(|/z(5fVi?/2|-7) 


: h e H J 



® In this respect the samples in t' are equivalent so we are interested in counting not 
them but the number of hyperconcepts. 
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where we used the fact that the only way the minimum of the functions exceeds 7 
is if all functions exceed it. Order the elements in each set of Cat by the underlying 
ordering on X. Then put the sets in lexical ordering starting with the first up 
to the element, so for instance, if = 3, m = 4 and 

Cs = { {X 2 , xs, Xg, a:io}, {xg^ X5,xs, Xg}, {a; 3 , X%, a;io, xis}} 

then the ordered version is 

{{X 2 , X5,Xs, Xg}, {xg, Xs, Xg, a;io}, {^ 3 , Xg, Xig, X 13 }}. 

Let fV be a positive integer and consider the mapping G : ^ S'^ which takes 

a hypersample Cw and produces a hypersample Cat consisting of non-overlapping 
sets. It is defined as follows: 

Procedure G: 

Given a hypersample Cat construct Cat as follows: Let . For any 2 < 

i < N, let 

gii) ^ gii) \ IJ 5(fc), 
fc=i 

Let N be the number of non-empty sets 

Note that N may be smaller than N since there may be a set in Cat which 
is contained in the union of other sets in Cat. Leaving the empty sets out we 
henceforth let denote the new hypersample. It is easy to verify by induction 
that the sets of are mutually exclusive and their union equals that of the 
original sets in Cat. 

For any point a; G X let 

^lix) = l{xe{z: \h{z) - B/2\ - 7 > 0}) 

or simply 9h{x). Let 

m 

esO){h) = Y[eh{sf), 

with 

(^) = [es(D (M. ■ • ■ . es(N) (h)] 
where for brevity we sometimes write v{h). Let 

VniCN) = {vcnW :h€H} 

or simply y(^Af)- Then (4) becomes |VH(CAf)l- We have the following: 

Claim 1 For any Cn C S, \Vn{CN)\ < \Vn{G{CN))\ ■ 

Proof We make repetitive use of the following: let A, B C X he two non-empty 
sets and let G = B \ A. Then for any h, any b G {0, 1}, if [eA{h),eB{h)] = [b, 0], 
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then [eA{h),ec{h)] may be either [6,0] or [6,1] since the elements in B which 
caused the product eB{h) to be zero may or may not also be in C. In the other 
case if [e^( 6 ,), es(/i)] = [ 6 , 1] then [e^( 6 ,), ec( 6 .)] = [6,1]. Hence 

\{[eA{h),eB{h)] : h G H}\ < \{[eA{h),ec{h)] : h G H}\ . 

The same argument holds also for multiple Ai, . . . , B and C = B\ IJ^i Ai. 



Let Cat = G{C,n)- We now apply this to the following: 

|{[cs(i) {h)i 65 ( 2 ) ( 6 -), 65 ( 3 ) {h ), . . . , CgiN) {h)] : h G 7f}] 

= |{[es(i)(^)>®S(2)(^):es(3)(6,),...,es(iv)(/i)] :6 ,g7i:}| (4) 

^ |{[es(i)(^)>es(2)(^),es(3)(6,),...,es(iv)(/i)] :6,G7f}| (5) 

^ |{[esd)(M>e5(2)(/i),es(3)(/i),es(4)(/i)...,es(iv)(/i)] -.h&nW ( 6 ) 

< • • • 

— I { [®sG) (^)> ®s( 2 ) (^)j 65 ( 3 ) (^)j 65 ( 4 ) {h), . . . , Cg(N) (/i)] : h G 7i} I (7) 



where (4) follows since using G we have 5*-^^ = (5) follows by applying the 

above with A = B = and C = ( 6 ) follows by letting Ai = 

A 2 = S^'^\ B = and C = Finally, removing those sets which are 
possibly empty leaves fV-dimensional vectors consisting only of the non-empty 
sets so (7) becomes | { [eg(i) (h), . . . , eg(w) (/i)] : h G H] \ . □ 

Hence (4) is bounded from above as 

ilH;(Cw)<|VH(G(Cw))|. ( 8 ) 

Now, for a positive integer N* , define a mapping Q : ^ S'^ as follows: 

Procedure Q: 

Given a hypersample Qn ■ Construct Cat* as follows: Let Y = X \ and let 
the elements in Y be ordered according to their ordering on X (we will refer to 
them as y\, j/ 2 , • ■ ■)■ Let For 2 <i < jJfj — to -I- 1, let = yt-i 

and = S[*+\ 1 < j < m — 1 - 

Henceforth denote by N* = j j — to -I- 1. We now have the following: 

Claim 2 For any I < N < jSj, and any (n Q 

I^h(g(Cw ))1 < iyH(G(g(Cw)))l 

Proof. Let Cn = G{Q{(n)) and as before fpj = G(CAf)- Note that by defini- 
tion of Procedure Q, it follows that CiV is a hypersample having N = N* non- 
overlapping sets. They consist of a single set of cardinality to and sets 
2 < i < N, each of which contains a single element of X. Moreover, their union 
Ufci This holds for any and 1 < N < jS]. 

Consider the sets Fh(Cjv)’ Th(Cjv) denote them simply by V and V. For 
any v G V consider the following subset of H, 

B{v) = {h GB. '■ v{h) = {)} . 
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We consider two types of z) G F : The first does not have the following property: 
there exist functions ha, hj 3 G B{v) with 9~l {x) for at least one point 

a; G X. In this case, we have for all h G B{v) 

= esd)(/i) 

and for 2 < j < fV with kj ranging over the indecies of all sets C we 
have 

eg^kj)(h) = Og(j){h). 

Hence it easily follows that 



|VB(C)(C/f)l = I^B(ii)(Cw)|- 

Let the second type of v be the complement, namely, there exist functions ha, 
h /3 G B{v) with 0'l^{x) yf least one point x G X . If such x exists 

only in then the same argument as above holds and we still have 

I^B(C)(C/f)l = I^B(ii)(CAr)|- 

If however there is also such x in some ‘2 < j < N , then there exists some 
kj such that c and 



e§(.kj){ha) yf eg(kj){hfj) 

due to the sets S^'^\ 2 < z < iV, all being singletons. Hence for this second type 
of i) we have 

I^B(C)(Cw)l > I^B({i)(CAr)|- (9) 

Combining the above, then (9) holds for any v G V. 

Now, consider any two distinct Va, vp G V. Clearly, B{va) H B{v/j) = 0 since 
every h has only one unique v{h). Moreover, for any ha G B{va) and hb G B{v)j) 
we have v{ha) yf v{hb) for the following reason: there must exist some set 
on which ha and hb differ (since Va y^ v/j). If z = 1 then they must differ on 
Else if 2 < z < IV, then they differ on some set C where 2 < j < N. 
Hence no two distinct Va, Vf 3 map to the same v. We therefore have 

l^(Cjv)| = i^B(-o)(Cjv)i 

vev 

< E i^B(«)(c^)i (10) 

vev 

= |1"b(Czv)I 

where (10) follows from (9) which proves the claim. □ 

Note that the dimensionality of the boolean vectors in the set Vh{G{Q{(n))) 
is N* which again is |X| — m + 1, regardless of the cardinality N of Cat. This 
follows by definition of Procedure Q. 
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Let us denote by any hypersample obtained by Procedure Q followed by 
Procedure G, namely, 

Cat. = 

with any set C X of cardinality m and 

= where Xi, e X \ S*^^\ k = 2,...,N*. 

Hence we may now write 

max max iT«' (C at) < max max iPw (G(0(CAf)))l (11) 

l<Af<|S|CivC§ ~ 1<N<\S\CnCS' 

< max |PH(CAr*)l (12) 

Cat* 

where (11) follows from (4), Claim 1 and Claim 2 and (12) follows by definition 
of Cat* • Now, 



|14r(CAf*)l = l{[es*(i)(Mj ■ • ■ 1 eg.(iv*)(ft.)] : h G H}\ 

— 2 |{[es*(2) (h), . . . , eg. (AT*) (h)] : h G 7f}| (13) 



where (13) follows trivially since eg.(i)(h) is binary. So from (12) we have 



max max U-h' (Cn) < 2 max |{[ec.(2) (h), . . . , eg.(Ar*) (h)] : h G 7f}| 
l<Af<|S|CArCS ^ Cat* 



< 2 max 






(14) 



where x\, . . . , xn*-i run over any N* — 1 points in X. Fix any subset Xat*-! = 
{a;i, . . . ,XAr*-i} C X. We henceforth denote 

G^(X^._i) = \{[9l{x^), 91{xn^.^)] ■.h&n}\. (15) 

We proceed to bound C-y{XM*-i)- 

For any real number u define a quantization function 



Qj{u) = 7 



u 

1 



For a function h G H the function Qj{h{x)) maps from X into the finite subset 
= {0,7,27 ,..., [H/ 7 J 7 } of [0,H]. We denote by QjiH) the function class 
{Qj{h) : h G H}. Consider the restriction of Qj{H) on Xat*-! and denote it by 
An with functions ah G An mapping Xj\i*-i into Z^. 

For any subset Y C let denote the space of real- valued functions on Y . 

For any / G let the loo(l^)-norm of / be defined as ||/||f = max^gv |/(a^)|- 
For any class F of functions on Y let 7Vl(e, F, loo{Y)) be the packing number, i.e., 
the size of the maximal e-separated set in F with respect to the ^oo(F)-norm. 
Let the uniform packing number be defined as 



M{e,F,n) = ma,x{M{e, F,Iod{Y)) : Y, |P| = n}. 
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Every pair of distinct elements ai,02 G An satisfy ||ai — q;2||f >7 since they 
must be different on at least one point x €Y and on x their values a\{x) and 
a2{x) are restricted to the set Zj. Hence An is itself a maximal 7-separated set 
and therefore 

\An\ = M{'^,An,loo{AN*-i))- (16) 

Note also that 

C^{Xn^-i) < \An\ (17) 

since given any h G H, for x G Xn*-i with O'Kx) = 1 then ah{x) in general 
can take one of several possible values in Zj. Hence it now suffices to bound 
M{j,An,N*-l). 

Denoting by dj( = i&t-^{An) then for n > dyi we have from Theorems 12.1 
and 12.8 in [1] that 

/ n2\ log(eSn/( 7 (iA)) 

M{%-f,An,n) . (18) 

This result holds for a general class of real-valued functions^ (not necessarily 
with a discrete finite range as for An)- 
Now, if 7 < 2e then 



< fate_7/2(H) (19) 

since suppose Qj{H) e-shatters a set {x^, . . . ,xn} C X with translate vector 
r = [ri, . . . , rfc] G Then for all v G {—1, 1}^ there is a function fy gH with 

Q~fifvixij)) - Tj > e if = 1 
Q-f{fv{xij)) - Tj < -e if Vj = -1 

f ^ j ^ k. It follows that 

fv(xi^)-rj>e ifvj = l 

fv(xi^) - Tj < -e + j if Vj = -1 

or equivalently 

fv{xij ) -Xj - 7/2 > e - 7/2 if Uj = 1 
fv{xt^) - Vj - -f /2 < -e + j /2 if Uj = -1, 

f < j < k, so H {e — 7/2)-shatters {x^, . . . ,Xi^.} with a translate vector r = 
[n-h7/2, ...,rfc+7/2]. 

Hence continuing from (19) and letting e = 7 then 
d-A = fat-y{Q.y{'H)) < fat..y/2(7f). 

^ Related to this, for a finite VC-dimension class H of binary-valued functions with 
a suitable definition of a functional margin, [7] generalized Sauer’s Lemma [6] and 
estimated the cardinality of F-y C H, where F-y consists of all h G FI having a margin 
ds{h) > 7 on a set S' C N. In [8] this quantity is shown to have a sharp threshold 
behavior with respect to the margin parameter. 




208 Joel Ratsaby 



So (18) becomes 



/ n 2 \ fat^/2CH) log(eBn/(7fat^/2CH))) 

M{%-i,An,n) <2y-^j . (20) 

Letting n = — 1, from (16), (17) and (20) it follows that if — 1 > fat-y/2(7f) 

then 



/16(A^* — fat^/i6CH)log(8eB(Ar*-l)/(7fat^/i6CH))) 

C-),(Arjv»_i) < 2 f \ . (21) 

Together with (2), (14), (15) and recalling that N* — 1 = \X\ — m, we therefore 
have 

16(1 Al| — m)B‘^ \ log(8eS(|X|-m)/(7fat^/i6CH))) 

7^ / 

Now \H'^\ is clearly bounded from above by Uu' (|§|) hence we have 

This proves Theorem 3. □ 

Note, the function fat-y() is non-increasing with 7 hence since the first factor 
in the bound dominates the first log factor then for all 7 > c, for some constant 
c, log \H'^\ is non-increasing with increasing 7 and decreasing with increasing m. 

6 Conclusions 

Existing data-dependent learning-error estimates indicate that larger-margin 
samples can be worth more for learning and hence implicitly convey more infor- 
mation about the target. In this paper we posed the question of determining the 
complexity of such good samples. We introduced a new notion of a class of sets 
of samples which we denote as hyperclass and then obtained an estimate of its 
cardinality as a function of the margin parameter 7 and sample size m. The log 
of this quantity is our measure of the complexity of such 7-good samples. 



max U-H' (N) < 4 
1<AT<|S| ^ “ 
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Abstract. This paper studies the on-demand broadcasting problem with 
deadlines. We give the first general npper bound and improve existing 
lower bounds on the competitive ratio of the problem. The novelty of 
onr work is the introdnction of a new job scheduling problem that allows 
cancellation. We prove that the broadcasting problem can be rednced to 
this scheduling problem. This reduction frees us from the complication of 
the broadcasting model and allows us to work on a conceptually simpler 
model for upper bound results. 



1 Introduction 

This paper revisits the on-demand broadcasting problem with deadlines. Im- 
proved lower bound and upper bounds are presented. In particular, the upper 
bounds are based on a reduction of the broadcasting problem to a job schedul- 
ing problem that allows cancellation. This reduction allows us to work on a 
conceptually simpler model for improved upper bounds. 

On-demand broadcasting: The input is a sequence of page requests for a 
broadcasting server. These requests arrive at arbitrary times. The pages re- 
quested have possibly different length. At any time, the server can broadcast 
only one page and preemption is allowed. A unique feature of this problem is 
that all pending requests for a page can be satisfied with one single broadcast. 
Previous work on the broadcasting problem mainly focused on minimizing the 
total or maximum flow time [1,3-5, 10, 11]. Broadcasting with deadline has not 
been studied until recently [7, 8] . Here each request has a deadline, and no profit 
is gained if a page is delivered beyond the deadline. The objective is to maximize 
the total (weighted) page length of the requests served by their deadlines. Kim 
and Chwa [8] studied the kind of preemptive broadcasting in which a preempted 
page will be re-broadcasted from the beginning (intuitively, this model does not 
assume that all preempted clients to have buffered a partial page properly) . This 

* This research was supported in part by Hong Kong RGC Grant PolyU-5172/03E. 

** This research was supported in part by Hong Kong RGC Grant HKU-7045/02E. 
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version of broadcasting problem will be referred to as Broadcasting in the rest of 
this paper. Kim and Chwa gave a lower bound of min{v^, r} on the competi- 
tive ratio of any on-line algorithm for Broadcasting, where A is the ratio between 
the length of the longest and shortest page, and r is the maximum number of 
requests for a page arriving at a time. For the special case when A = 1, i.e., 
all pages have the same length, they showed that a heuristic based on highest 
profit is (3-|-2-\/2)-competitive. Finding an upper bound for the general case was 
left as an open problem. More recently, Kalyanasundaram and Velauthapillai [7] 
considered a stronger preemption model in which the server can resume broad- 
casting of a preempted page at the point of preemption and they also showed 
some interesting lower bound and upper bound results. 

Summary of results: In this paper we study broadcasting with deadline and 
focus on the preemption model used by Kim and Chwa (i.e., a preempted page 
will be rebroadcasted from the beginning) . Our first result is two improved lower 
bounds on the competitive ratio. We show that even if r = 1, no online algorithm 
for Broadcasting is thus improving the previous lower bound of min{v^, r} 
to simply ^/A. For the case Z\ = 1, we give a new lower bound of 2.59. This 
paper also contributes new results on the upper bounds. In particular, a (4Z\-f 3)- 
competitive algorithm is given. Note that this is the first performance guarantee 
for the general case. Further, when Z\ = 1, a more stringent analysis shows that 
the algorithm is 5-competitive, improving the previous upper bound of 3 -I- 2\/2. 
The following table summarizes all the above results. 





Upper Bound 


Lower Bound 


General A 


4A + 3 


• Va 

• minjV^, r} [8] 


A = 1 


• 5 

• 3 -h 2 V 2 [8] 


2.59 



The improvement on the upper bounds of Broadcasting is based on a reduc- 
tion to a new job scheduling problem that allows cancellation. This job schedul- 
ing problem is conceptually simpler, allowing us to obtain a better analysis of 
the highest-profit heuristic. This reduction result should be contrasted with the 
observation by Kalyanasundaram and Velathapillai that existing upper bound 
results on the traditional scheduling problem do not hold for the broadcasting 
problem [7]. 

Scheduling jobs with possible cancellation: Online deadline scheduling is 
a classical real-time problem [2,6, 9]. In the traditional model, users can release 
jobs at arbitrary times for scheduling on a single processor. These jobs have 
different processing times, profits and deadlines, and the processor can process 
one of these jobs at a time. The objective is to maximize the total profit of the 
jobs completed by their deadlines. To obtain a relation with Broadcasting, we 
consider a variant of job scheduling in which every job has an indefinite deadline 
but a user can cancel the job while waiting for processing. Denote this variant 
as JS-Cancellation. In fact, JS-Cancellation is natural in real-life applications. For 
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example, schedulers for printing jobs usually allow a user to cancel a printing 
job. Note that following the setting of Broadcasting, preemption is allowed and 
a preempted job will be restarted (instead of resummed at the point of preemp- 
tion). 

We say that an online problem P is reducible to another online problem Q, 
denoted as P ^ Qi if th® following is true: Given any online algorithm for Q, we 
can construct an online algorithm for P with the same competitive ratio. It is 
easy to prove that if P ^ Q, then any upper bounds (on the competitive ratio) 
for Q implies the same upper bounds for P. The main result in this paper is 
that Broadcasting -< JS-Cancellation. 

Motivated by the above reduction, we work on upper bound results for 
JS-Cancellation. We show a refined analysis of the highest-profit heuristic; this 
leads to a (4Z\' -I- 3)-competitive algorithm for JS-Cancellation, where A' is the 
ratio between the processing time of the longest and shortest job. Together with 
the relationship Broadcasting ^ JS-Cancellation, we give the first upper bound for 
broadcasting in the general setting. Furthermore, when Z\' = 1, the competitive 
ratio can be improved to 5. 

Organization of the paper: The rest of the paper is organized as follows. 
In Section 2, we give some definitions and notations. In Section 3, we present 
lower bound results for Broadcasting. In Section 4, we present upper bound 
results for JS-Cancellation. In Section 5, we present the reduction of Broadcasting 
to JS-Cancellation. 

2 Definitions and Notations 

In this section, we give the definitions and notations needed for our discussion. 
The input for Broadcasting is a sequence cr of page requests. Every page P has 
a length £{P). Every page request R has an arrival time a{R), a deadline d{R), 
a weight w{R), and a name of a page p{R) that it asks for broadcasting. We 
say that R is alive at time t if (1) a{R) < t < d{R) and (2) it has not received 
the complete p{R) yet. We say that it is healthy at time t if it is alive at t and 
t + £{p{R)) < d{R). Note that if R is healthy at t, then we can broadcast p{R) at 
time t and R can still receive the complete p{R) before its deadline. Given any 
schedule S for a, its profit p{S) is the total weighted page length of the requests 
served by their deadlines, i.e., p{S) = ^n^-fiw{R)£{p{R)) where TZ is the set of 
requests served by their deadlines. 

The input for JS-Cancellation is a sequence cr of jobs interleaved with cancel 
requests. Every job J has an arrival time a(J), a processing time £{J) and a 
profit p{J). A cancel request C has an arrival time a{C) and the name of a job 
j{C) that it asks for cancellation. Given any schedule S for a, its profit p{S) is 
the total profit of jobs completed. 

3 Lower Bound Results for Broadcasting 

In this section, we give a lower bound on the competitive ratio of any online 
algorithm for Broadcasting. Recall that A is the ratio between the length of the 
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longest and shortest page, and r is the maximum number of requests for a page 
arriving at a time. Our results show that even if r = 1, no online algorithm is 
better than v^-competitive. Note that r = 1 means that no two requests for 
the same page arrive at the same time. 

Theorem 1. No online algorithm for Broadcasting is better than \/A-eompeti- 
tive. This holds even if r = 1. 

Proof. For the sake of simplicity, we show a lower bound of [v^J . By a refine- 
ment of the argument, we can prove a lower bound of \/A and the details will 
be given in the full paper. 

Let a = [V^\ ■ Assume that there are two pages, Pq and P\ whose length 
are and a, respectively. Given any online algorithm A, we construct a sequence 
of requests as follows. At time 0, a request R for Pq arrives with deadline at time 
a^, i.e., it has a tight deadline. Requests for Pi arrive in phases. There are at 
most phases. In general, for 0 < z < — 1, the z-th phase starts at time ia 

and contains a requests arriving at consecutive time units whose deadlines are 
all at time (z -I- 2)a — 1. If A broadcasts P\ in this phase, no further requests 
arrive after this phase; otherwise, we enter the next phase. Note that no further 
requests arrive after the (a^ — I)-th phase. 

Now we analyze the performance of A against the optimal offline algorithm O. 
There are two cases. (1) If A broadcasts Pq to serve R, then O will satisfy all 
requests for Pi by broadcasting Pi at the end of each phase, i.e., at time (z -I- 
\)a — 1 for the z-th phase. There are altogether o? phases, each with a requests. 
Therefore, p{0) = ■ a = a^. Thus, p{0)/ p{A) = jcA = a. 

(2) If A broadcasts Pi in phase z, only the a requests released in this phase 
can be satisfied, all other previous requests are either not alive or not healthy. 
Thus, p{A) < a-a = In this case, O broadcasts Pq and satisfies the request R. 
Therefore, p{0)/p{A) > o? jo? = a. As a result, no online algorithm is better 
than Qf-competitive. 

For the special case with constant page length, i.e., Z\ = 1, we have a lower 
bound result that no online algorithm can achieve a competitive ratio better 
than 2.59. 

Theorem 2. No online algorithm for Broadcasting with A = 1 has competitive 
ratio better than 2.59. 

Proof. Consider two sequences of requests, X = {xq,xi, . . .) and Y = {yo,yi, 

. . .), each request is on a distinct page of unit length. All requests are with 
tight deadlines. Arrival times of xq, xi, . . . are 0,1,.. ., respectively. Arrival times 
of yo,yi, ■ ■ ■ are 0.5, 1.5, . . ., respectively. The weights of the requests follow a 
series T of non-negative numbers i/'o j V'l j • ■ • such that ifo = 1 , V'l = c,tp 2 = — 1 , 
and ifi = c{tpi-i — tpi-s) for z > 2, where c > 1 is a constant. The weights of 
requests xq, xi,X 2 , . . . are ipo, ip 2 , 4>a, ■ ■ ■, respectively, and the weights of requests 
yo, yi, 2 / 2 , • ■ • are z/ji, z/js, z/^s, . . ., respectively. 

The adversary works as follows. If the online algorithm A satisfies a request j, 
no further requests arrive. If j is Xk for some k, p{A) = ip 2 k- In this case, the 
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optimal offline algorithm O satisfies requests yo, . . . , yk, thus p{0) = V’2i+i- 

If j is yk for some k, p{A) = tp2k+i- In this case, O satisfies requests xq,. . . , Xk+i, 
thus p{0) = fp 2 i- It can be shown by mathematical induction that both 

Yi=o'^ 2 i+i/^ 2 k and Yi=o '^ 2 i/i’ 2 k+i are c. Hence if A satisfies a request j, 
then A is c-competitive. We want to find the maximum value of c such that the 
series S' contains a non-positive element. That implies that there is a request 
j without profit. A must not switch to that request and hence A will satisfy a 
request prior to request j. It can be verified that if c is less than or equal to 2.59, 
'P contains a non-positive element. 



4 Upper Bound Results for JS-Cancellation 

In this section, we present an algorithm, called GREEDY, for JS-Cancellation and 
analyze its competitive ratio. Recall that A' is the ratio between the processing 
time of the longest and shortest job. We consider the general case in which A' 
is arbitrary and the special case in which A' = 1. We first define the online 
algorithm GREEDY as follows. 

When the first job arrives or when a job completes, GREEDY schedules 
a job with the highest profit (ties are broken arbitrarily). When a job 
is being processed and a new job arrives, GREEDY will preempt the 
current job and start the new job if the new job carries at least twice 
the profit of the current job. 

We now analyze the performance of GREEDY. Let Sq and So be the set 
of jobs completed by GREEDY and the optimal offline algorithm OPT, respec- 
tively. For any set of jobs S, let p{S) be the sum of the profit of all the jobs 
in S. We prove the competitive ratio by constructing a mapping / to map each 
job in So to some job in So such that for every job J G So, the total profit 
of all jobs in So that map to J is at most (4Z\' -|- 3) times the profit of J. 
More precisely, let F{J) denote the set of jobs J' such that /(J') = J. We 
will show that for any job J G Sq, p{F{J)) < p{J) ■ (4Z\' -|- 3). As a result, 

P(So) = Yj^So Pinj)) < Yj^So PiJ) • (4^' + 3) = (4^' + 3) • p{So). The 
following theorem states the competitive ratio of GREEDY for JS-Cancellation. 

Theorem 3. GREEDY is (4Z\' + i)- competitive for JS-Cancellation. 

Proof We first define the mapping as follows. For each job J G Sq, define a 
sequence of jobs Qj = {Ji, J 2 , ■ ■ ■ , Jk) with Jk = J such that for 1 < i < 
k — 1, Ji is preempted by during the execution of GREEDY. (Note that 
Ji, J 2 , • ■ • , Jk-i may not be in Sq-) For all 1 < t < fc, we let = J. Notice 
that for 1 < z < /c - 1, 2p{Ji) < p(J*+i), hence, p{Qj) < Z)i<*<fc < 

2p( J). The mapping / is constructed as follows. For any job J G SoP So, we set 
f{J) = J. For any job J G So — So, suppose when J starts being processed by 
OPT, GREEDY is processing a job J', then we set /(J) = Note that J' 

must exist, otherwise GREEDY will schedule to process J. 
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Consider any job J € Sg- We claim that for every job Ji G Qj, during the 
time interval li that GREEDY is processing Ji, the profit obtained by OPT is at 
most 2A'p{Ji). Note that in the time interval li, OPT may be processing some 
jobs whose start time or finish time is outside li but we only count the profit 
obtained in li. 

Let r be the processing time of Ji. Suppose that in the time interval h, 
OPT has processed jobs Si, . . . , with processing time t\, . . . ,Tm and profit 
pi, . . . , Pm, respectively. We have (ti -I- • . . + Tm) = t. By the definition of A', we 
have min(ri, . . . , Tm) > t j A' . Combining these two constraints, we have A' > m. 
In the execution of GREEDY in the interval li, none of the jobs 
preempts the job Ji. Therefore, by the definition of GREEDY, px < 2p{Ji) for 
all 1 < a: < m. In other words, max(pi , . . . , pm) < 2p{Ji). Hence, pi -I- ... -I- 
Pm < 2mp{Ji) < 2A' p{Ji). Recall that p{Qj) < 2p{J). The sum of the profit 
of F{J) within the interval is at most < 4Z\'p(J). 

Notice that OPT may be still processing an un-completed job when GREEDY 
completes J, and this job has a profit at most 2p{J). Furthermore, F{J) may 
contain J and OPT may process J outside the interval Ui<i<fe^i- ^s a result, 
p{F{J)) < 4A' p{J) + 2p{J) + p{J) = p(J) • (4Z\'-|-3). Hence, Fhe theorem follows. 

Now, we consider the special case A' = 1. We can see that GREEDY is 
7-competitive by substituting Z\' = 1 to Theorem 3. Yet, by a more stringent 
analysis of GREEDY for A' = 1, GREEDY is shown to be 5-competitive. Notice 
that A' = 1 means that all jobs require the same processing time. 

Theorem 4. GREEDY is 5-competitive for JS-Cancellation when A' = 1. 

Proof. The proof uses a similar framework as that of Theorem 3. For any job 
in So, we define the same mapping / as before. Consider any job J € So ■ 
Define the sequence Qj = {J\, J^, ■ ■ ■ , Jk) as before. Because the processing 
time is the same for every job, there are at most k jobs other than J, say 
J[, J 2 , . . . , J^, in So mapping to J such that when J' starts processed by OPT, 
GREEDY is processing Jp, for 1 < p < fc. Since Jp does not preempt Jp, we 
have p{Jp) < 2p{Jp). Furthermore, F{J) may contain the job J itself. Therefore, 

^ P{J) + ^i<p<k P(’^p) — Pi'^) F X)i<p<fc “ p{F) + ‘^p{Qj) S 

5p{J). Therefore, the theorem follows. 

5 The Reduction 

In this section, we describe a reduction from Broadcasting to JS-Cancellation. 
Precisely, suppose that we have an online algorithm A for JS-Cancellation with 
competitive ratio c. We show how to construct from A an online algorithm B for 
Broadcasting with competitive ratio at most c. 

Roughly speaking, B works as follows. From the input sequence (Tb of page 
requests for Broadcasting, B creates a sequence aj of jobs and cancel requests for 
JS-Cancellation. Whenever B creates any job or cancel request, it passes it to A 
immediately for scheduling. B monitors the behavior of A. If A takes any action 
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such as starting a job, B takes some corresponding action such as broadcasting 
a page. When creating the jobs or cancel requests, B makes sure that at any 
time t and for every page P, if there are healthy requests i? with p{R) = P, 
then there will be a single job J for JS-Cancellation such that ^(J) = i{P) and 
P{J) = ^ReH w{R) ■ ^(P), where H is the set of healthy requests. Thus, if both 
A starts job J and B broadcasts P at some time t and there is no preemption, 
then both A and B receive the same profit and the two tasks finish at the same 
time, namely t + i{P). 

To get an idea on how the competitive ratio is preserved, let Sj and Sr be 
the schedules decided by A and B, and let Oj and Os be the optimal offline 
schedules for aj and (jb, respectively. We will prove below that 

1. p{Sj) = p{Sb) (i.e., the profit of the two schedules are equal) and 

2. p{Ob) < p{Oj). 

Recall that A has competitive ratio c. Then, we have < p\sjj — '"> 

hence the competitive ratio of B is no more than c. 

Before giving the details, we need some definitions. For any page P, let 
H{P,t) be the set of page requests R that are healthy at time t and p{R) = P. 
When B creates a job J at t, its profit and length will be set according to some 
healthy page P at t. To remember this relationship, we define g{J) = P. 

Now, we summarize the actions of B. At any time t, for any page P, the 
following events may occur. 

— A page request R with p{R) = P arrives. 

— A healthy page request R with p{R) = P becomes not healthy. 

— A broadcast of the page P has been preempted. 

— The page P has been broadcasted completely. 

In all these cases, B first creates cancel requests to delete all waiting jobs (i.e., 
not being processed) associated with P. More precisely, for each waiting job J 
with g{J) = P, B creates a cancel request C with j{C) = J. Then, if H{P,t) 
is not empty, B creates a job Jo with g{Jo) = P, ^{Jo) = ^{P) and its profit 
p{Jo) = w{R) • £{P). 

Note that all the jobs and cancel requests created will have arrival time t. 
Recall that B will also monitor A to see how A schedules the jobs. When A 
starts or restarts a job J, then B broadcasts the page g{J). When A preempts a 
job J, then B preempts the current broadcasting, which must be for page g{J). 
Below, we analyze the profits of the schedules. 

Lemma 1. p(S's) = p{Sj). 

Proof. By how B creates jobs and cancel requests, it is clear that at any time t, 
every waiting job J has the property that 

p(j)= ^ w{R)-e(p). 

ReH{g{J),t) 
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The profit of scheduling J at time t is equal to the profit of broadcasting 
page g{J) at time t. For every job J, A schedules J at time t if and only if B 
broadcasts g{J) at time t. Thus, we have p{Sj) = p{Sb)- 

Lemma 2. p{Ob) < p{0,j). 

Proof. We prove the lemma by constructing a job schedule S for aj such that 
p{Ob) < p{S) and p{S) < p{Oj). Then we can conclude that p{Ob) < p{Oj). 
Notice that Oj is the optimal schedule for aj, thus, we have p{Oj) > p{S). 
The schedule S is constructed to satisfy a property, which leads to the proof of 
p{Ob) < p{S). Consider the set of requests TZb and TZq served by Sb and Ob, 
respectively. S will have the property that p{S) = X)fi6(7?,oU7?,B) 

Then, we have p{S) > J2 r(^tzo = p{Ob)- 

Now, we show how S schedules jobs in aj and prove that it satisfies the prop- 
erty mentioned above. Note that the requests that are served hy Sb correspond 
to a set of jobs in aj that are scheduled by Sj. In other words, there are no 
cancel requests in aj that cancel these jobs; S must be able to schedule these 
jobs and obtain at least a profit of w{R)i{p{R)). 

We then consider the requests that have been served by but not by Sb- 
Suppose that k pages, Pi,P2, - ■ ■ ,Pk are broadcasted hy Ob such that Pi is 
broadcasted at time for 1 < i < k. Notice that all requests in TZq ~ R-b that 
are served by Ob at time ti are healthy, thus, there is a job R G aj at time ti 
such that g{Ji) = Pi- We make S to schedule R at time ti, thus, obtaining a 
profit of the corresponding requests. Therefore, S can obtain a total profit of 
'^RR'R.o-'R.b) w{R)£{p{R)). Combining with the profit obtained from jobs that 
are scheduled by Sj, the total profit of S equals to X)b6CRou7?,b) w{R)£{p{R)) . 
Hence, the lemma follows. 

By the above two lemmas, we have the following theorem. 

Theorem 5. Given a c-competitive online algorithm for SS-C^nceWsthon, we can 
design an online algorithm that is c-competitive for Broadcasting. 



References 

1. Y. Bartal and S. Muthukrishnan. Minimizing maximum response time in schedul- 
ing broadcasts. In Proceedings of the Annual ACM/SIAM Symposium on Discrete 
Algorithms, pages 558-559, 2000. 

2. S. Baruah, G. Koren, D. Mao, B. Mishra, A. Raghunathan, L. Rosier, D. Shasha, 
and F. Wang. On the competitiveness of on-line real-time task scheduling. Real- 
Time Systems, 4:125-144, 1992. 

3. J. Edmonds and K. Pruhs. Multicast pull scheduling: when fairness is fine. Algo- 
rithmica, 36:315-330, 2003. A preliminary version appears in SODA 2002. 

4. J. Edmonds and K. Pruhs. A maiden analysis of longest wait first. In Proeeedings 
of the 15th Annual Symposium on Discrete Algorithms, pages 811-820, 2004. 

5. B. Kalyanasundaram, K. Pruhs, and M. Velauthapillai. Scheduling broadcasts in 
wireless networks. Journal of Scheduling, 4(6):339-354, 2001. A preliminary version 
appears in ESA 2000. 




218 



Wun-Tat Chan et al. 



6. B. Kalyanasundaram and Kirk Pruhs. Speed is as powerful as clairvoyance. Journal 
of the ACM, 47(4):214-221, 2000. 

7. B. Kalyanasundaram and M. Velauthapillai. On-demand broadcasting under dead- 
line. In Proceedings of the 11th Annual Annual European Symposium on Algo- 
rithms, pages 313-324, 2003. 

8. J.H. Kim and K.Y. Chwa. Scheduling broadcasts with deadlines. In Proceedings of 
the 9th Annual International Conference on Computing and Combinatorics, pages 
415-424, 2003. 

9. G. Koren and D. Shasha. An optimal on-line scheduling algorithm for over- 

loaded uniprocessor real-time systems. SIAM Journal on Computing, 24(2) :318- 
339, 1995. 

10. K. Pruhs and P. Uthaisombut. A comparison of multicast pull models. In Pro- 
ceedings of the 10th Annual European Symposium on Algorithms, pages 808-819, 
2002 . 

11. S. Muthukrishnan Swarup Acharya. Scheduling on-demand broadcasts: New met- 
rics and algorithms. In The Fourth Annual ACM/IEEE International Conference 
on Mobile Computing and Networking, pages 43-54, 1998. 




Maximization of the Size and the Weight 
of Schedules of Degradable Intervals 



Fabien Bailie, Evripidis Bampis, and Christian Laforest 

LaMI, CNRS UMR 8042, Universite d’Evry, 

Tour Evry 2, 523, Place des Terrasses 91000 Evry, France 
{fabien. bailie , bampis , laf orest}@lami . univ-evry . f r 



Abstract. We consider the problem of scheduling a set of intervals with 
controllable processing times in the presence of release dates and dead- 
lines on a set of identical parallel machines in order to maximize either 
the number (the size problem) or the total weight (the weight prob- 
lem) of accepted intervals. We call these jobs degradable intervals. We 
stndy a special case, called the immediate case, in which each accepted 
interval has to start at its release date. We call the general case the non- 
immediate case. For both criteria, we present optimal algorithms for the 
immediate case. For the non-immediate case, we prove the NP-hardness 
of the size problem and present a 1.58-approximation algorithm. For the 
weight problem, we propose a 2.54-approximation algorithm. 



1 Introduction 

In this paper, we are interested by the network access reservation problem. As- 
sume clients wishing to reserve for a time the access to one of the k channels of a 
link managed by a provider. Due to congestion, the provider may not satisfy all 
clients. In such situations, he can degrade the service or reject some users. His 
goal is either the maximization of the accepted (even with degradation) connec- 
tions or the weight (or profit) of the accepted connections. This situation can be 
seen as a scheduling problem that we describe now. 

The model. The problem that we consider in this paper consists in scheduling 
a set cr of degradable intervals, on k identical machines. Each interval Gi is 
represented by a release date > 0, a maximal processing time pi > 0, a 
deadline di = ri + pi, a minimal processing time 0 < Qi < Pi and a weight 
Wi > 0. We consider two variants: The immediate model where an accepted 
interval has to start at his release date and the non-immediate model in which 
an interval may start at any date between ri and di. An interval is acceted if 
it is scheduled (without preemption) on one of the machines of the system at 
or after date and before date di, during a time U such that qi < ti < Pi. 
When an interval is accepted from date t\ to date t 2 , the machine is busy for 
all t\ <t<t 2 . An interval is said to be degraded if ti < Pi, otherwise, it is non- 
degraded. Given a schedule O we denote by N{0) its size (i.e. the number of 
scheduled intervals) and by W{0) its weight, which is the sum of the over 
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all the accepted intervals. Given an interval instance cr let N*{a) (resp. FF*(cr)) 
be the maximum number (resp. maximum weight) of intervals of a which can 
be scheduled. In the sequel, if A is a polynomial-time algorithm, we denote by 
Pn (resp. pw) the approximation ratio for the size (resp. weight), satisfying for 
all interval instances <t, pnN{A{(t)) > N*{a) (resp. pwW{A{a)) > W*{a)). 

Related works. When degradation is not allowed, maximizing the size of an 
interval schedule can be done in polynomial time using the algorithms of Faigle 
et al. in [8]. Maximizing the weight of a non-degradable schedule can also be 
done in polynomial time using the algorithms of Carlisle et al. in [6] and Arkin 
et al. in [1]. Note that in the non-degradable intervals case, the size is a sub- 
problem of the weight (by letting all the weights to be one) . Clearly, this is not 
the case for degradable intervals since a degraded interval could have a weight 
less than one unit. In task scheduling problems, there is a time window [ri,di] 
greater than the processing time pi, in which each scheduled interval must be 
executed for exactly a time pi. The problem of scheduling non- degradable tasks 
{di > ri -\-pi) has been studied by Bar-Noy et al. [5] and it is shown that both 
problems are NP-hard and approximable within a ratio near to 1.58 for the 
size problem and near to 2.54 for the weight problem. In [2], we studied the 
simultaneous maximization of the size and the weight. In [3] and [4], we studied 
the case where the intervals require more than one machine to be executed. We 
also analyzed in [3] the impact of the degradation of the number of machines 
alloted to intervals on the two criteria. 

Our contribution. In Section 2, we show how the immediate and the non- 
immediate scheduling problems can be reduced to the non degradable intervals 
{di = ri~\-pi and qi = pi) or tasks {di > ri -\-pi) scheduling problems for the size 
maximization. This implies that the immediate scheduling problem can be solved 
in polynomial-time and the non-immediate scheduling problem is NP-hard and 
can be approximated within a ratio of 1.58 with the algorithm of Bar-Noy et al 
[5]. 

In Section 3, we study the weight objective. We show that the immediate 
scheduling problem can be solved in polynomial-time. We present an algorithm, 
inspired by [6], exploiting the fact that the intervals finish their execution at 
special dates (either a release date or their own deadlines), which transforms the 
problem into finding the minimum cost flow of size k on an associated directed 
acyclic graph. For the non-immediate scheduling problem, we propose a 2.54- 
approximation algorithm for the special case where the weights are equal to the 
maximum execution duration for each intervals {wi = pf). 

2 Size Maximization 

In this section, we study the size maximizationproblem , i.e. the problem of 
maximizing the number of scheduled intervals. We show that the problem can 
be transformed into a non-degradable interval scheduling problem (solvable in 
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polynomial-time, see [1,6,8]) for the immediate model and into a task schedul- 
ing problem (NP-hard, see [5]) for the non-immediate model. All the results of 
this section are consequences of the following lemma which shows that for both 
variants there is an optimal solution executing each interval during the minimum 
amount of time. Note that since we are studying the size, we do not mention the 
weight Wi of the intervals in this section. 

Lemma 1. For the degradable interval scheduling problem, in both models, there 
exists an optimal schedule for the size in which all the scheduled intervals are 
fully degraded (every scheduled interval Ui is executed for a time qi). 

Proof. By contradiction, assume the existence of a schedule O* with an optimal 
size having at least one of its intervals scheduled for a longer time than qi and 
accepting more intervals than Od, the schedule with maximal size in which all 
intervals are fully degraded. Let O'* be the schedule obtained by degrading all 
the intervals of O*. Clearly, O'* is valid and accepts the same number of intervals 
as O* so N{0'*) > N{Od). This contradicts the optimality of Od- □ 



2.1 Immediate Scheduling Problem 

We solve in polynomial time the size problem with this algorithm: 



Transform the instance a of degradable intervals = {ri,pi,qi,di) into an in- 
stance o' of non-degradable intervals cr' = {r'^ = ri,p'^ = qi,d'^ = Vi + qi). 

Apply an optimal algorithm (see [1,6,8]) on o'. 



Theorem 1. For the immediate model, the above algorithm returns an optimal 
schedule for the size in polynomial- time. 

Proof. Optimality follows from Lemma 1 and the fact that the intervals have to 
start at their release dates. As the instance transformation is polynomial and all 
the cited algorithms are polynomial, our algorithm is also polynomial. □ 

2.2 Non-immediate Scheduling Problem 

Here, we show that maximizing the size in the non-immediate model is NP-hard 
because it contains the non-degradable task {di > ri pi) scheduling problem. 
We show that any algorithm with an approximation ratio for the task scheduling 
problem is also an approximation algorithm for our problem. So, the algorithm 
of Bar-Noy et al. [5] is also a 1.58-approximation algorithm for this problem. 

Theorem 2. In the non-immediate model, maximizing the size is NP-hard. 

Proof. By Lemma 1, there is an optimal solution executing all the intervals 
for a time qi. Therefore, one can transform any input instance / into an in- 
stance of tasks It by the following transformation: For each degradable interval 
(Tj = {vi,pi, qi, di) add the task ct' = (r' = rj,p' = q^, d' = di) to It. The inverse 
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transformation (from any set of non-degradable tasks to a set of degradable in- 
tervals) can also be done easily. A schedule of It on k machines is then equivalent 
to a schedule of / on fc machines. Since maximizing the size of a schedule for 
tasks is NP-hard (see [5]), our problem is also NP-hard. □ 

Theorem 3. For the size criterion, if A is an a -approximation algorithm for 
the task scheduling problem, then A is also an a- approximation for the non- 
immediate model. Consequently, a 1.5S- approximation algorithm exists. 

Proof Let a be an instance of degradable intervals, that is a set of at = 
(ri,pi,qi,di). Let a' be the task schedule instance build from cr as follows: For 
each (Ti = {ri,pi,qi,di), add the task tr' = (r' = r*,p' = qi,d[ = di). From 
Lemma 1, we get N*{a) = N*{a') and the first point of theorem follows. 

Bar-Noy et al. in [5] approximate the size of task schedules within a ratio of 
1.58. The ratio is then the same for the non-immediate scheduling problem. □ 

3 Weight Maximization 

Here, the maximization of the weight objective is studied. For the immediate 
model, we present in Section 3.1 an optimal algorithm. Indeed, we reduce the 
problem to a mincost flow problem, which is solvable in polynomial time (see [7] 
or [9]). In [6] and [1] such a reduction has been used in order to show that the non- 
degradable interval scheduling problem is polynomial. For the non-immediate 
model, we propose in Section 3.2 a 2-approximation algorithm for the single 
machine case and proportional weights and we extend it, using Theorem 3.3 of 
Bar-noy et al. [5], to a 2.54-approximation for the multiple machines case. 

3.1 Immediate Scheduling Problem 

We will exploit the following property showing that no gain can be obtained by 
creating idle times due to over-degradation. 

Property 1 Let a be an instance of degradable intervals and O* be a schedule 
of a with an optimal weight in the immediate model. The completion time of any 
interval of O* is either its own deadline or a release date of another scheduled 
interval. 

Proof. By contradiction, suppose the existence of an optimal schedule for the 
weight O* such that an accepted interval aj is degraded and complete its exe- 
cution at a date which is neither a release date nor its deadline. Let Cj be its 
completion time and be the interval that directly follows aj on the machine 
i where aj is executed. Let be the release date of cr^; rz is also the date at 
which cTj; starts its execution, thus Cj < rz, and i is idle between these two dates. 
Two cases must be considered if az exists: If rz > dj, let O'* be the schedule 
executing the same intervals at the same dates as O* except for aj which is 
enlarged until dj . If < dj , let O'* be the schedule executing the same intervals 
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at the same dates as O* except for aj which is enlarged until Vz- If (Tz does not 
exist, let O'* be the schedule executing the same intervals at the same dates 
as in O* except for aj which is enlarged until dj. In both cases, O'* is valid 
and W{0'*) > W{0*) because of the increase of the execution time of aj. This 
contradicts the optimality of O*. □ 

In spite of the infinity of execution alternatives of an interval, the decision 
points are deadlines and release dates. In the sequel, we call significant all sched- 
ules having for completion times of all the executed intervals either release dates 
or deadlines. Note that, by Property 1, all optimal schedules are significant. 

We present now a polynomial time optimal algorithm for the weight maxi- 
mization in the immediate model. We transform the problem into a mincost flow 
problem by constructing a directed graph from an interval instance. We show 
that this graph has a polynomial number of nodes. Then, we show that at every 
mincost flow of size k, we can associate a valid optimal significant schedule. To 
finish, we show how to build this schedule in polynomial time. 

Transformation of an instance of intervals into a directed graph 

Let (j be an instance of degradable intervals, R = : (Ji = (ri,pi,di,Wi) 6 cr} 

be the set of the release dates, and Hi = {ri}U {r ■. r e R, rt + qi < r < di}U{di} 
be the set of significant dates of Ci S cr. 

Creation of graph G{(t) = [V, E), where V (resp. E) is the set of vertices (resp. 
arcs). 

Vertices of Each vertex « of V is associated to a date date(u). 

— Create vertices S and T with date(S) = —oo and date{T) = +oo. 

— s-type vertices: For each date r a R, create a vertex s with date{s) = r. 

These |iJ| vertices are indexed following the order of their associated dates 
(if i < j then date(si) < date(sj)). 

— M-type vertices: For each interval and for each date d in create a 
new vertex associated with date d. These |f?i| vertices «i,i, ■ ■ ■ , Ui,|X 2 j 

are indexed following the order of their associated dates: 
ri = date(ui^i) < • • • < = di. 

Arcs of G(<y): Each arc is associated to a capacity and a weight. 

— Create the |iJ| + 1 arcs (5, si), (si, S 2 ), ■ ■ • , (s|_r|, T) all with capacity k and 
weight 0. 

— For each interval di create the following arcs: 

• For J = 1, ■ ■ ■ , |l?i| — 1: Create the arcs with capacity 1 and 

weight — (date(«ij+i) — date{uij))t^. 

• Create the arc with capacity 1 and weight 0 with 1 < /9 < |iJ| 

and date{sf}) = date(ui^i) = r^. 

• If there exists a date r in ij such that di = date(ui ^Q^^) < r, then let 

Sj (date(s~/) G R) be the vertex such that date{s~f) = min{date(r) : 
di < date(r),r G i?}, and create the arc Sj) with capacity 1 and 

weight 0. Otherwise, add the arc with capacity 1 and weight 

0. 

• For j = 2,3, •• -,112^1 — 1, create the arc {ui^j,sp), 2 < (5 < \R\ with 
dateisj}) = date{uij) with capacity 1 and weight 0. 



Let us show that the mincost flow of size k of any graph built by the trans- 
formation algorithm yields to an optimal k machines schedule for the weight, 
and that this can be done in polynomial time. 
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Lemma 2. The number of nodes produced by the algorithm is 

Proof The total number of nodes created by the algorithm is 2+|i?|+^^.g^ 
where |i?| is the number of distinct release dates and X)cr go- 1^*1 represents the 
number of nodes created for the n intervals. The additive factor 2 comes from S 
and T. Note that |i?| < n because in the worst case, all intervals have distinct 
release dates. Let us upper bound go- \ ^i\i number of nodes generated 
from the intervals. For each cTj, \ f2i\ nodes are created. But \fii\ < |i?| + 1 and 
since there are n intervals, Xo-iGo- \ ^i\ ^ 0{n^). The lemma follows. □ 

Lemma 3. The transformation algorithm returns an acyclic graph. 

Proof. Suppose by contradiction that there is a cycle in a graph produced by 
the algorithm. By construction, when an arc is added between two vertices u 
and V, we have date{u) < date{v). Furthermore, for all intervals ai G cr, we have 
1 17^1 > 2. By construction, there is no cycle containing only s-type vertices or 
containing only w-type vertices. If a cycle is possible in the graph, it contains both 
u-type and s-type vertices. No arc (uij,Uk,i) where i ^ kis present in the graph. 
Furthermore, to reach a u-type node from any s-type node one has to visit a node 
Ui^i. But this node cannot have an s-type node as a successor. Thus, the only 
possible cycles contain a path of at least two u-type nodes between two s-type 
nodes. Let Ui^i and Uij, j > 2, be the extreme vertices of these M-type nodes path, 
and let s„ and sp be respectively the predecessor of Uip and the successor of Uij. 
By construction, we have date(sa) = date{uip) < date{uij) = date(sp). Thus 
since Sq is in a cycle there must exist an arc (X, Sa) with an elementary path from 
sp to X. Thus, date(X) < date(sa) < date(sp) < date{X)\ A contradiction. □ 

Given any path p from S to T, one can extract a valid significant schedule 
with the following algorithm: 

Extraction of a schedule from a path 

Let (7 be an instance of partially degradable intervals. 

Let Gin) be the graph constructed with the transformation algorithm. 

Let p be a path from S to T 

Let O be a current schedule, initialized to empty. 

For each arc (v, w) of p: 

if V and w are rt-type vertices, (v,w) = (rtij, schedule in O the 
corresponding “piece of the interval” tJi between date(uij) and date(ni,j+i) 

While there are two consecutive “pieces of interval” ai in O, merge these two 
pieces. 

Lemma 4. This algorithm yields a valid significant schedule on one machine. 

Proof. Let O be the schedule returned by the extraction algorithm. In order to 
prove the validity of O, we have to show that each accepted interval ai has a 
processing time at least qi and at most pi and starts at r^, secondly, all ai & O 
are executed continuously and finally, two scheduled intervals do not overlap. 

By construction of G{a), if ai is extracted, this means that p follows the 
arc (sp,Uip) with date(sp) = date{uip). Then, p follows thus the 
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algorithm executes the piece of interval ai between the dates date{ui^i) and 
date{ui^ 2 )- But, by definition, date{ui^i) = rt and date{ui^ 2 ) = min{r e R : 
fi + qi < r < di\ . Thus, Ui is executed at least during a time If p follows all 
Uij nodes then O schedules ai for a processing time equal to pi. Note that since 
Q(a) is acyclic (Lemma 3), no node Uij can be visited twice by p. This means that 
no interval can be executed for a time larger than pi . The only way for p to reach 
a node Uij in G{a) is to visit the node Ui^i. Thus, ai cannot be interrupted and 
executed later in O. The fact that there cannot be overlapping intervals follows 
from the fact that, by construction, all arcs (u,v) satisfy date{u) < date{v). 

It remains to show that O is significant. This is true because if an interval is 
executed, it starts at (because p follows Furthermore, it finishes either at 
a release date if it is degraded or at its own deadline (each node Uij is associated 
to a release date except the last one which is associated to di). □ 

Lemma 5. Let a be an instanee of degradable intervals, and G{a) be the asso- 
ciated directed acyclic graph. From a mincost flow of size k (from S to T), we 
can extract an optimal schedule having a weight equal to the opposite of the total 
cost of the flow. 

Proof. Let us consider a mincost flow F of size k in G{a). Since G{a) is acyclic, 
each unit of flow can follow a particular elementary path from S to T. 

Lemma 4 shows that for each unit of flow, we construct a valid significant 
schedule on a single machine. We have to show that the union of these one 
machine schedules is still valid on k machines and that the obtained schedule 
is optimal for the weight. For the validity, we just have to show that there is 
no interval scheduled twice on different machines. To make it possible 2 units of 
flow have to cross an arc (s/ 3 ,Uii)- But this is not possible because the capacity 
of this arc is only one. 

The weight of the corresponding schedule is the opposite of the cost of the 
flow. Assume by contradiction that there exists a significant schedule generating 
a weight greater than the opposite of the cost of F. Using the inverse technique 
of the extraction algorithm, one can find in G{a), k elementary paths from S 
to T. By adding one unit of flow on each arc among these k paths, we get a 
flow of size k with a lower cost than F. Contradiction. Our technique constructs 
the optimal significant valid schedule. A consequence of Property 1, is that an 
optimal significant schedule is also optimal for the immediate model. □ 

Theorem 4. A schedule with an optimal weight in the immediate model can be 
found in polynomial- time. 

Proof. We just have to transform any instance of degradable intervals into a 
graph using our algorithm, to compute the mincost flow of size k on this graph 
with one of the algorithms described in [7] or in [9], and to extract the optimal 
schedule according to the flow. All these steps can be done in polynomial-time. 
Lemma 5 ensures the optimality. □ 
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3.2 Non-immediate Scheduling Problem for Proportional Weights 

We study the maximization of the weight for the non-immediate model for the 
particular case where the weights are equal to the execution times {wi = Pi). We 
describe a 2-approximation algorithm for a single machine system (fc = 1). We 
extend it to a 2.54 approximation algorithm in the general case of k identical 
machines by a method used by Bar-Noy et al. in [5]. At this time we do not 
know whether the problem is NP-hard. 



A 2- Approximation Algorithm for a Single Machine System. Before 
describing our results we need some definitions. 

Definition 1. Let a be an instance of degradable intervals. The shadow of a 
is the set of dates (real numbers): Shadow{a) = Uo- : Vi < x < di\. 

We define the thickness ta{x) of any date x to be the number of intervals 
Gi of G such that Vi < x < di: ta{x) = \{Gi G g : Vi < x < di}\. 

The thickness t^ of an instance g is the maximal thickness of shadow (g): ta = 
max{t^{x) : x G Shadow{G)}. 

Clearly, any valid schedule (on one machine) is a set of intervals whose shadow 
has thickness equal to one. The thickness of any instance is at least 1 (if no 
intervals overlap) and is at most n = |cr|. 

The shadow is not necessarily composed by a single real interval. We call 
component each connected real sub-interval of the shadow, i.e. subsets Si of the 
shadow, maximal for inclusion satisfying: Va;,y G Si, if x < z < y then z G Si. 

Definition 2. Let g be an instance of degradable intervals and Si be any com- 
ponent of Shadow(G) that is a real interval of the form [6j,ej). The weight 
of any component Si is the value Ci — bi and the weight of the shadow is 
W{Shadow{G)) = '^Si 

Note that for any valid schedule O of g we have: W{Shadow{G)) > W{0). 

Lemma 6. Let g be an instance of degradable intervals. There exists a set 
g' G_ G such that: ta' < 2 and Shadow{G) = Shadow(G'). 

Proof. Let us consider the following algorithm. 

(j' := (j; 

While there exists a date x with (x) > 2 do 
Let a; be a date with ta>{x) > 2; 

Let C(x) = {(Ti : (Ti £ a' and ri < x < di} 

Let (7a be the interval of C(x) having the minimum release date. 

Let (Tj, be the interval of C(x) having the maximum deadline. 

Update cr' by removing all the intervals of C(x) except <Ja and Cf, (note that we may 
have (7a = (71,). 

This algorithm executes at most n = \g\ times the While loop. Each execu- 
tion strictly reduces the number of intervals of g' . At the end, we clearly have 
ta' < 2. Let us now prove that: Shadow{G) = Shadow(G'). As g' C g we have: 
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Shadow{a') C Shadow{a). Assume by contradiction that there exists a date 
y G Shadow{a) and y ^ Shadow{a'). This means that there is a particular 
step in the main While loop, examining date x, before which ta'{x) > 2 and 
y G Shadow(a') and after which t„i{x) < 2 and y ^ Shadow{a'). This While 
loop examines the date x and removes from a' all the intervals of C{x) except 
(7a and 7 b ■ By hypothesis, a consequence is that after the deletions we have: 
ta'{y) = 0. This means that y is neither a date of Ga nor a date of Gb- However, 
by the definition of Ga and Gb we have: Va < H < db but since Va < y < db and 
da < y (otherwise y is a date of Ga- Note that the date da is not a date of Ga) 
and y < rb (otherwise y is a date of Gb) we get: Va < da < y < Xb < db- This 
implies that Ga and Gb do not intersect and contradicts the fact that Ga and Gb 
must have the date x in common. The contradiction follows. □ 



Let G be any degradable intervals instance. Each degradable interval Gi = 
{ri,di,pi,qi,Wi = Pi) G g is transformed into a non degradable interval: g^ = 
{ri,di,pi,qi = Pi, Wi = Pi)- The set of these non degradable intervals is denoted 
G^ - Let A be the algorithm described in [1] (or in [6]) that constructs in poly- 
nomial time an optimal weight schedule from any instance of non degradable 
intervals {qi = pi)- Our algorithm ADN just consists in applying algorithm A 
to G^ - Note that the obtained schedule is valid and that the whole algorithm is 
polynomial. The next result shows that ADN is a 2-approximation algorithm. 



Theorem 5. Let g be an instance of degradable intervals, k = 1, O* be a maxi- 
mal weight degradable schedule of g and g^ be the non degradable instance asso- 
ciated to G - Then O, the resulting schedule of A on g^ , satisfies: W{0) > ^ . 



Proof By Lemma 6, there is a set g' C g such that: to-' < 2 and Shadow{G) = 
Shadow{G')- g' can be partitioned into 2 subsets: cr'(l) and g'{2) with tcr'(i) = 
^ct'( 2) = 1- This means that all the intervals of g'{1) (resp. cr'(2)) can be scheduled 
without degradation (between their release dates and their deadlines) on one 
machine. The sets cr'(l) and g'{2) can be considered as valid non degradable 
schedules of g'- W.l.o.g. assume that W{g'{\)) > W{g'{2))- 

Since A constructs an optimal weight non degradable schedule of g^ and as 
o'^(l) C G^ , we have: W{0) > W{g'{1))- Moreover, since W{g'{1)) > W(g'(2)), 
we have: W{g'{1)) > ^ But, ^(-'(i))+m^(A( 2)) > w(Shadon,(.'y) _ 

As g' satisfies: Shadow{G) = Shadow(G') (by Lemma 6), we deduce: 
w{Shadow(a )) _ w (Shadow{a)) ^ since W {Shadow{G)) > W(0*), we have: 

W{0) > 

Note that the bound is tight. Indeed, consider the following 4 intervals in- 
stance: Gi = (ri = 0,pi = 1, di = 1, gi = 1 — £, w\ = 1), ct 2 = (r2 = 1 — e,p 2 = 
1 -I- 2£,c?2 = 2 -I- £, 92 = 2,W2 = 1 + 2£), (73 = (rs = 2,P3 = 1,^3 = 3,93 = 
1 — £, W3 = 1), Gi = (r4 = 3 — £,P4 = 1 -|-2£, (i4 = 4-|-£, 94 = H-2£, Wi = 1 -|-2£). 
A schedules G 2 and <74 generating a weight of 2 -|- 4£. The optimal schedule con- 
sists in scheduling all the intervals degrading gi and G 3 generating a weight of 
4 -I- £. tends to 2 when £ tends to 0. □ 
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Multiple Machines Systems. We propose an algorithm A{k) for finding a 
schedule from an instance of degradable intervals a for k machines. A{k) uses as 
a subroutine ADN. Using similar arguments than Bar-Noy et al. in [5], we get 
Theorem 6. 



Algorithm A{k) 

Input: Any degradable intervals instance <t and k identical machines. 
For j := 1 to fc do 

Apply ADN on instance <t and machine number i; 

Remove from cr the previously scheduled intervals; 

Return the schedule composed of the k constructed sub-schedules; 



Theorem 6. For the degradable intervals scheduling problem on k identical ma- 
chines, A{k) is a ( 2 fc)*=-( 2 fc-i)'= -approximation. Note that ( 2 fc)*=-( 2 fc-i)'= — 2.54. 

Final Discussion and Future Work. The maximization of the size and the 
weight in the immediate model can be done in polynomial time. The size max- 
imization in the non-immediate model, is NP-Hard and the complexity of the 
weight problem is unknown. Nevertheless, we have proposed approximation al- 
gorithms for both metrics. In the future, we will focus on the NP-hardness of the 
weight problem in the non-immediate model. We will also seek to generalize the 
results presented here for the weight maximizaton in the non-immediate model 
to the case where the weights can take any values. 
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Abstract. We consider the problem of scheduling n jobs with release 
dates on m identical parallel batch processing machines so as to minimize 
the maximum lateness. Each batch processing machine can process up 
to B (B < n) jobs simultaneously as a batch, and the processing time 
of a batch is the largest processing time among the jobs in the batch. 
Jobs processed in the same batch start and complete at the same time. 
We present a polynomial time approximation scheme (PTAS) for this 
problem. 



1 Introduction 

In this paper, we consider the following batch scheduling problem that arises 
in the semiconductor industry. The input to the problem consists of n jobs and 
m identical batch processing machines that operate in parallel. Each job j has 
a processing time pj, which specifies the minimum time needed to process the 
job without interruption on any one of the m batch processing machines. In 
addition, each job j has a release date rj before which it cannot be processed 
and a delivery time qj. Each job’s delivery begins immediately after its processing 
has been completed, and all jobs may be delivered simultaneously. Each batch 
processing machine can process up to B (B < n) jobs simultaneously as a batch. 
The processing time of a batch is the largest processing time among the jobs 
in the batch. The completion time of a batch is equal to its start time plus its 
processing time. Jobs processed in the same batch have the same completion 
time, which is the completion time of the batch in which the jobs contained. 
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This model is called the hounded model, which is motivated by the problem of 
scheduling burn-in operations in the manufacturing of integrated circuit (IC) 
chips (See [1] for the detailed process). Our goal is to schedule the jobs on m 
batch processing machines so as to minimize max j{Cj + qj}, where Cj denotes 
the completion time of job j in the schedule. 

The problem as stated is equivalent to that with release dates and due dates, 
dj, rather than delivery times, in which case the objective is to minimize the 
maximum lateness, Lj = Cj — dj, of any job j. When considering the perfor- 
mance of approximation algorithms, the delivery-time model is preferable (see 
[2]). Because of this equivalence, we denote the problem as P\rj,B\L^ax, using 
the notation of Graham et al. [3]. 

Problems related to scheduling batch processing machines have been exam- 
ined extensively in the deterministic scheduling literature in recent years. Here, 
we give a brief survey on the previous work on the bounded problems with due 
dates. Lee et al. [1] presented a heuristic for the problem of minimizing maximum 
lateness on identical parallel batch processing machines and a worst-case ratio 
on its performance. They also provided efficient algorithms for minimizing the 
number of tardy jobs and maximum tardiness under a number of assumptions. 
Brucker et al. [4] summarized the complexity of the problems of scheduling a 
batch processing machine to minimize regular scheduling criteria that are non- 
decreasing in the job completion times. Along with other results, they proved 
that the bounded problems of minimizing the maximum lateness, the number 
of tardy jobs and the total tardiness on a single batch processing machine are 
strongly AP-hard, even if H = 2. Both [1] and [4] concentrated on the model 
of equal release dates. As for the model of unequal release dates, the known 
previous results were restricted on the case of a single batch processing machine 
[5-7]. Ikura and Gimple [5] provided an 0{n^) algorithm to determine whether 
a due date feasible schedule exists under the assumption that release dates and 
due dates are agreeable (i.e., < Uj implies di < dj) and all jobs have identical 

processing times. Li and Lee [6] proved that the problems of minimizing the 
maximum tardiness and the number of tardy jobs where all jobs have identi- 
cal processing times are strongly AP-hard even if release dates and due dates 
are agreeable. Wang et al. [7] presented a genetic algorithm to minimize the 
maximum lateness with release dates on a single batch processing machine. 

To the best of our knowledge, the general Pjrj, PjPniax problem has not been 
studied to date. In this paper we present a PTAS for this problem. Our study 
has been initiated by [8] and [9] . Deng et al. [8] presented a PTAS for the more 
vexing problem of minimizing the total completion time with release dates on 
a single batch processing machine. Hall and Shmoys [9] presented a PTAS for 
problem PjrjjLniax (the special case of our problem where B = 1). We draw 
upon several ideas from [8, 9] to solve our problem. 

This paper is organized as follows. In Section 2, we introduce notations, 
simplify our problem by applying the rounding method, and define small and 
large jobs. In Section 3, we describe how to batch the small jobs. In Section 4, we 
first show that there exists a (I -I- be)- approximate outline, a set of information 
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with which we can construct a (1 + 5e)-approximate schedule. We proceed to 
define the concept of outline and then enumerate over all outlines in polynomial 
time, among which there is a (1 + 5e)-approximate outline. Put together, these 
elements give us our PTAS for the general problem. 

2 Preliminaries 

We use opt to denote the objective value of the optimal schedule. To establish a 
polynomial time approximation scheme (PTAS), for any given positive number 
e, we need find a solution with value at most (1 + e) • opt in time polynomial in 
the input size. 

In this section, we aim to transform any input into one with simple structure. 
Each transformation potentially increases the objective function value by 0(e) • 
opt, so we can perform a constant number of them while still staying within a 
l + 0(e) factor of the original optimum. When we describe such a transformation, 
we shall say that it produces 1 + 0(e) loss. To simplify notations we will assume 
throughout the paper that 1/e is integral. For explicitness, we define the delivery 
time of a batch to be the largest delivery time among the jobs in the batch. We 
use p{Bi), d{Bi), S{Bi) and C{Bi) to denote the processing time, delivery time, 
start time and completion time respectively of batch Bi. We use Tmax(5') to 
denote the objective value of schedule S. We call a batch containing exactly B 
jobs a full batch, where B is the batch capacity. A batch that is not full will be 
called a partial batch. Let rmax = max^-r^, Pmax = max^p^, = max^g^ . 

The special case of P|rj, i?|Lmax where all = Qj = 0, denoted as P|P|Cmax, 
is already strongly AP-hard [1] (even for P = 1). Lee et al. [1] observed that 
there exists an optimal schedule for PlPlCmax in which all jobs are pre-assigned 
into batches according to the BLPT rule: rank the jobs in non-increasing order 
of processing times, and then batch the jobs by successively placing the B (or 
as many as possible) jobs with the largest processing times into the same batch. 

To solve the general version of the problem with release dates, we need to 
amend the BLPT rule: we may use this rule for all the jobs even though they 
are not released simultaneously, or use it only for a subset of the jobs. 

We use the BLPT rule for all the jobs and get a number of batches. Denote by 
d the total processing time of these batches. Then we have the following lemma. 

Lemina 1, max|rniaxjPmax 5 Qmax? ^ opt ^ ^niax “t” Pmax “t” Qmax “t” 

Proof It’s easy to see that opt > max{ rmax, Pmax, <?max, dfm}. We use the BLPT 
rule for all the jobs and get a number of batches. Starting from time rmax we 
schedule these batches by List Scheduling algorithm [10]: whenever a machine 
is idle, choose any available batch to start processing on that machine. Any job 
will be delivered by time rmax + Pmax + <Zmax + d/m in this schedule. □ 

Let 5 = e • max{rmax,Pmax, <Zmax, d/w}- We will use 6 this way throughout 
this paper. 

A technique used by Hall and Shmoys [9] allows us to deal with only a 
constant number of distinct release dates and delivery times. The idea is to 
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round each release date and delivery time down to the nearest multiple of 6. 
Since rmax < (l/e)<5 and qmax < {l/e)5, there are now at most 1/e + 1 distinct 
release dates, as well as l/e+ 1 distinct delivery times. Clearly, the optimal 
value of this transformed instance cannot be greater than opt. Every feasible 
solution to the modified instance can be transformed into a feasible solution to 
the original instance just by adding 5 to each job’s start time, and reintroducing 
the original delivery time. Since 5 < e ■ opt, the solution value may increase by 
at most a 1 + 2e factor. Therefore we get the following lemma. 

Lemma 2. With 1 + 2e loss, we assume that there are at most l/e+ 1 distinct 
release dates and l/e+ 1 distinct delivery times. 

As a result of Lemma 2, we assume without loss of generality that the release 
dates take on 1/e + 1 values, which we denote by px, p^, . . ■ , Pi/e+i, where pi = 
(z — 1)(5. We set Pi/e +2 = oo. We partition the time interval [0,oo) into l/e+ 1 
disjoint intervals of the form A = [pi, pi+i) (z = 1, 2, . . . , 1/e + 1). We assume 
that the delivery times take on 1/e + 1 values, which we denote by < ^2 < 

• • • < ■Cl/e+l- 

We partition the jobs (batches) into two sets according to their processing 
times. We say that a job or a batch is small if its processing time is less than 
<5/(l/e + 1)^, and large otherwise. We use Tu to denote the set of small jobs 
with Pi and £/ as their common release date and delivery time respectively, 
z= l,2,...,l/e+l; ; = l,2,...,l/e+l. 

By Lemma 1, we know that there are at most 4(l/e + l)^/e large batches 
processed on each machine in any optimal schedule. More precisely, on each 
machine, there are at most (1/e + 1)^ large batches started in interval U for 
z = 1, 2, ... , 1/e and at most 3(l/e+l)^/e large batches started in interval 
Though simple, this observation plays an important role in our algorithm since 
it allows us to show that the number of distinct outlines is polynomial in the 
input size. We defer the details till Section 4. 

The following lemma enables us to deal with only a constant number of 
distinct processing times of large jobs. 

Lemma 3. With 1 + e loss, the number of distinct processing times of large jobs, 
K, can be up-bounded by 4(1 + e)^/e^. 

Proof. We round each large job’s processing time down to the nearest multiple 
of e/4 • i5/(l/e + 1)^. Clearly, the optimal value of the rounded instance cannot 
be greater than opt. Since pj > i5/(l/e+ 1)^ for each large job j, by replacing the 
rounded values with the original ones we can transform any optimal schedule 
for the rounded instance into a (1 + e/4)-approximate schedule for the original 
instance. Since pj < (l/e)i5, we get k < 4(1 + e)^/e^, as claimed. □ 

Let Pi < P 2 < • • • < be the k distinct processing times of large jobs. 
We assume henceforth that the original problem has the properties described in 
Lemmas 2 and 3. 
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3 Batching the Small Jobs 

In this section, we describe how to batch the small jobs with 1 + 3e loss. The 
basic idea on which we rely is similar to that in [8] . 

For i = 1,2,..., 1/e + 1 and I = 1,2,..., 1/e + 1, we use the BLPT rule 
for all the jobs in Tu and get a number of batches, Bh , S/; , . . . , , such that 

Bh, Bfi, . . . , B^i'-~^ are full batches and q{B^i) > p{B^j^^), where p{B^i) and 
denote the processing times of the largest and smallest jobs in B^i, re- 
spectively. Then we have the following observation. 



kil 1 



^ (p(S'j) - g(S'j)) < 6/{l/e+lf 

h^l 



( 1 ) 



Let Bii = {^B^i ■. h = 1,2, ... We modify Bu to define a new set of 

batches Bu = : h = 1, 2, . . . , |, where B^i is obtained by letting all pro- 

cessing times in B^i be equal to qiB’li) for /i = 1,2, . . . ,ku — l and B^i' is obtained 
by letting all processing times in be equal to zero. 

Each original small job j is now modified to a new small job f , with pi 
(which is equal to the processing time of the batch that contains j'), rj and qj 
as its processing time, release date and delivery time, respectively. We call the 
jobs {pl,rj,qj : j = 1, 2, . . . , n} modified jobs, where pi = pj if j is a large job. 
We then define an accessory problem: 

BSl : Schedule the modified jobs {pl,rj,qj : j = 1, 2, . . . , n} on m identical 
parallel batch processing machines to minimize the maximum lateness. 

We use optl to denote the optimal value to BSl. Since pi < pj for all jobs 
J, we get optl < opt. We also observe the following fact. (Please contact the 
authors for the detailed proof if any reader is interested in it.) 



Lemma 4. There exists a schedule S for BSl with the objective value Lmax(<S') < 
optl + 2e • opt in which the set of the batches containing small jobs is just 

l/e+l (l/e+1 \ 



U U 



i=l 



1=1 



The following lemma enables us to determine the batch structure of all the 
small jobs a priori. 



Lemma 5. There exists a {l+3e)-approximate schedule S for P\rj, S|Lmax with 
the following properties: 



1/ e-l-l / 1/ £-1-1 \ 

1 ) the set of the batches in S containing small jobs is just IJ [J Bu \ ; 

i=i y 1=1 J 

2) the batches in S started in each interval on each machine are processed 
successively in the order of non-increasing delivery times. 



Proof. Consider a schedule for BSl that is described in Lemma 4. We trans- 
form it by replacing the modified jobs with the original jobs. By the previous 

inequalities (1), we have - P(^5)) + (p(-B')*') - 0) < 6/{l/e+lf 
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(z = 1, 2, . . . , 1/e + 1; I = 1, 2, . . . , 1/e + 1). Thus the solution value may in- 
crease by (5 < e • opt. As optl < opt, we get a (1 -I- 3e)-approximate schedule 
for -P|rj, -B|Linax in which the set of the batches containing small jobs is just 

l/e+l /l/e+1 \ 

U [j \. We then use the well-known Jackson’s rule [11] to transform 

i=i y 1=1 J 

this schedule without increasing the objective value: process successively the 
batches started in each interval on each machine in the order of non-increasing 
delivery times. Thus we get a schedule of the required type for P\rj, BjLniax- □ 

4 Scheduling the Jobs 

In this section we will design a polynomial time approximation scheme to solve 
problem Pjrj, BjLmax- 

By Lemma 5, we can pre-assign all the small jobs into batches and get 

l/e+l /l/e+1 \ 

U \J Bil ] in 0(n log n) time: use the BLPT rule for all the jobs in Tu 

i=i y 1=1 J 

(z = 1,2,..., 1/e -1-1; I = 1, 2, . . . , 1/e -I- 1). Thus we will restrict our attention 
to schedules of this form. 

Next we show that there exists a (1 J- 5e)~ approximate outline, a set of in- 
formation with which we can construct a (1 J- 5e)-approximate schedule. We 
motivate the concept from [9]. 

Let us fix a (1 J- 3e)-approximate schedule that is described in Lemma 5, 
S. Let Xiji be the total processing time of the small batches in S that are 
started in interval li on machine Mj with as their common delivery time. 
Let Xiji = [(1/e -I- 1)^ • Xiji/5'\ ■ 5/{l/e+ 1)^. That is, Xiji is the approximate 
amount of time in interval li on machine Mj that is spent processing the small 
batches with delivery time We then delete from S all the jobs and the small 
batches, but retain all the empty large batches. Let Vij^i be the set of the empty 
large batches in S that are started in interval li on machine Mj with Pk and 
as their common processing time and delivery time respectively. Recall that 
Pk denotes the fcth largest processing time among k {k < 4(1 + e)^/e^) distinct 
processing times of large jobs. We call the set {{Xiji,Yijki) ■ 1 < i < l/e-|-l,l< 
j < m,l < k < K,1 < I < 1/e J- 1} a (1 J- be)- approximate outline, since it can 
be used to construct a (1 J- 5e)-approximate schedule, as Lemma 6 below shows. 

Algorithm Ai 

Given a (1 J- 5e)-approximate outline, do the following: 

Step 1. Suppose that we have assigned small batches to intervals Ii to A-i. We 
describe the assignment of small batches to A. (z = 1, 2, . . . , 1/e J- 1.) 
For j = 1, 2, . . . , m and ^ = 1, 2, . . . , 1/e J- 1, we construct a set of small 
batches Ziji as follows: assign the small batches available in A with 
delivery time to Ziji, until the first time that p{Bh) > Xiji (or 

until there are no more available small batches with delivery time ^;). 
Clearly, if Xiji = 0, then Ziji = </• 
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Step 2. For i = 1, 2, . . . , 1/e + 1, j = 1,2, we stretch li to make an 

extra space with length 2i5/(l/e+ 1) and then start the small batches 
l/£+l K /l/e+1 \ 

in U Ziji and the empty large batches in IJ IJ Yijki ) as early 

1=1 k=l y 1=1 J 

as possible in on Mj in the order of non-increasing delivery times (by 
Lemma 5). 

Step 3. For ease of representation, we reindex arbitrarily the large jobs as x\, 
X 2 , ■ ■ ■■ Index the empty large batches arbitrarily as Bi, B 2 , ■ ■ ■■ By re- 
garding each large job Xg as a vertex in X, and each empty large batch 
Bh as B(the batch capacity) vertices Uhi,yh2, ■ ■ ■ ,UhB in Y, we con- 
struct a bipartite graph G with bipartition {X,Y), where Xg is joined 
to yhi,yh 2 , ■■■, yhB if and only if < S{Bh), Pxg < p{Bh) and < 
d{Bh). Then we use the Hungarian method [12] to get a matching of 
G that saturates every vertex in X, which corresponds to a feasible 
assignment of the large jobs to the empty large batches. 

Lemma 6. Given a {1 + be) -approximate outline, Algorithm A\ will find o (1-1- 
be)- approximate schedule in 0{{l/e-\- l)^m^ B^ / e^) time. 

Proof. Denote by S the (1-1- 3e)-approximate schedule from which the (1-1- 5e)- 
approximate outline is obtained. We will show that Algorithm A\ reconstructs 
S with 1 -I- 2e loss in 0((l/e -I- /e^) time. 

Clearly, the definition of plus the way to assign small batches to in- 
tervals, guarantees that every small batch gets assigned to some set We 
observe that for any i,j and I, ^ p{Bh) — Xiji < 2i5/(l/e-|- 1)^. We also ob- 

serve that there exists a matching of G that saturates every vertex in A(i.e., the 
assignment of the large jobs to the large batches in S). Therefore Algorithm Ai 
constructs a feasible schedule S' with Lniax(«S'') < Lmax(5') -I- 2(5 < (1 -|- 5e) • opt. 
Noting that a (1 -I- 5e)-approximate outline contains at most 4(l/e -I- Yfimje 
empty large batches, we get |A| < |F| < 4(l/e-|- YfimB je. Therefore it will take 
0((l/e-|- Yfirrt’B'^ je^) time to get a matching of G that saturates every vertex 
in X. Since the time complexity of Algorithm A\ is determined by Step 3, the 
lemma is proved. □ 

A possibility of a (1-1- 5e)-approximate outline is called an outline. We are 
going to enumerate over all outlines in polynomial time, among which there is a 
(1-1- 5e)-approximate outline. 

To do this, we define a machine configuration to be the restriction of an 
outline to a particular machine. Let us fix a particular machine, Mj. Recall that 
in any optimal schedule, on each machine, there are at most (1/e -I- 1)^ large 
batches started in interval A for i = 1,2, ... ,1/e and at most 3(l/e-|- l)^/e 
large batches started in interval /i/j+i. Combining the structure of an outline, 
we get the following observation. For i = 1, 2, . . . , 1/e, k = 1,2,. . . ,k and I = 
1, 2, . . . , 1/e, both the number of different possibilities of and that of Yijki 
are (l/e-|-l)^-l-l; while for 5 = l/e-|-l,A: = 1,2 ,...,k and I = 1,2, ... ,1/e, both 
the number of different possibilities of Xiji and that of Yijki are 3(l/e-|-l)^/e-|-l. 
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Therefore we can up-bound the number of different machine configurations by 
r < {[(l/e+ 1)2 + 1]1A+1 . [(l/e+ 1)2 + 1](1A+1)«}1A . {[3(i/e+ i) 2 /g+ i]i/.+i . 
[3(l/e-|-l)2/e-|-l]^^/'^''"^^”} < 2[(l/e-|-l)2-|-l]A/'=+i)(«+i)/<=, where k < 4(l-|-e)2/e"^. 

We denote the different machine configurations as 1, 2, . . . , _T. An outline can 
now be defined as a tuple {mi , m 2 , . ■ . ,mr), where is the number of machines 
with configuration i. Therefore there are at most (m -I- 1)^ outlines to consider, 
a polynomial in m. 

Given an outline, we can evaluate its objective value as follows. View each 
Xiji as an aggregated batch with processing time Xiji and delivery time 
respectively. For z = 1, 2, . . . , 1/e -I- 1 and j = 1, 2, . . . , m, we stretch A to make 
an extra space with length <5/(l/e-|- 1) and then start the aggregated batches 

K, ( 1/e+l 

U 

1 y 1=1 

early as possible in A on Mj in the order of non-increasing delivery times. The 
time by which all batches have been delivered is the objective value of the given 
outline. If some batch (an aggregated batch or an empty large batch) cannot be 
started in the specified interval, then the outline will be excluded. 

We are now ready to describe our algorithm, which constructs a feasible 
schedule from an outline with objective value as small as possible. 

Algorithm A2 

Step 1. Get all outlines, evaluate their objective values and exclude some of 
them as described above. 

Step 2. Invoke Algorithm Ai repeatedly to deal with the left outlines in the order 
of non-decreasing objective values until a feasible schedule is generated. 
(If some small batch or a large job cannot be scheduled and has to be 
eventually left, then the outline cannot generate a feasible schedule and 
will be excluded.) 

Step 3. Glean up the generated feasible schedule (delete the empty batches and 
move all the batches to the left as far as possible while keep them in the 
specified intervals) and then output it. 

Finally, we get the following theorem. 

Theorem 1 . Algorithm A2 is a PTAS for the problem P\rj, 



Xiji, Xij2, ■ ■ ■ , Xlj(^l/^_^.l'^ and the empty large batches in IJ 

k= 
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Abstract. We consider the problem of approximating a function on a 
d- dimensional voxel grid by a unimodal function to minimize the L 2 ap- 
proximation error with respect to a given measure distribution on the 
grid. The output unimodal function gives a layered structure on the voxel 
grid, and we give efficient algorithms for computing the optimal approx- 
imation under a reasonable assumption on the shape of each horizontal 
layer. Our main technique is a dominating cut algorithm for a graph. 



1 Introduction 

Given a geometric object, transforming it into an object satisfying a certain geo- 
metric property is a problem frequently discussed in computational geometry. In 
this paper, we consider the following problem: Consider a function f in d vari- 
ables as an input. We want to transform the function into a unimodal function 
/■p(a:) minimizing the L 2 error (with respect to a given measure distribution) 
under the condition that each horizontal slice {x : fv{x) < i} at height t has a 
good geometric shape. The computational complexity of the problem depends 
on the representation of input and output functions. For simplicity, we assume 
that the functions are defined on a voxel grid and investigate the complexity 
based on the number of voxels, although our framework works if / is defined 
on other types of subdivisions of such as triangulations by giving suitable 
modification. 

Precisely speaking, we consider a d-dimensional voxel grid F consisting of 
n = mi X m 2 X ... X md cells (voxels). We can assume mi = m for each 1 < z < d 
without loss of generality. We consider a function / : ^ M that is a constant 

function in each voxel c. We can assume that / is nonnegative without loss of 
generality. A grid region is a simply connected region represented as a union of 
voxels. We fix a family TZ of grid regions in T. Without loss of generality, we 
assume that 0 S 7^ and F G TZ. 

* This author was supported in part by the National Science Foundation under Grant 
CGR-9988468. 
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Definition 1. A pyramid is a series V of regions P(ti) (i = 0, 1, 2, . . . , /i) in 
TZ associated with a series of increasing nonnegative numbers {called heights) 
to < ti < t 2 <•■■< th satisfying that P{to) = P and the monotone property 
P{t) C P(t') fort> t'. 

The surface function f-p of the pyramid P is defined by fp{x) = ti li x & 
P{ti) — or equivalently, fp{x) = maxjti : x G P{ti)}. Because of the 

monotone property of the pyramid and simply-connectedness of regions in 7?,, 
fp is a unimodal function. 

Given a measure function p, : P —f K>o, the squared L 2 distance between 
fp and / is (11/ - fvlU)^ = ~ fv{x))^dx}. Since / is a 

constant in each voxel c, we can consider / as a function from T to M by defining 
/(c) = f{xc) where Xc is any representative point in each voxel c. Similarly, we 
define /-p (c) , and thus ( 1 1 / - /-p 1 1 2 ) ^ = Ecg r { M (c) ( / (c) - /-p (c) ) 2 } . We can further 
rewrite it as (||/ - /pI^)^ = E{P) = T,i=oT,ceP{t.)\Pit,+^)ific) ~ ^*)V(c)- 
The optimal pyramid approximating / is the pyramid (identified to its surface 
function) minimizing this distance. 

Although we consider a general measure function p since it is essential in 
some applications, the problem is very intuitive if p is the Euclidean volume 
function. See Figure 1 for an example where d = 1. The 1-dimensional problem 
is considered in [4], and a linear time solution has been given. If c? = 2, the 
optimal pyramid (Fig 2) can be considered as reshaping a terrain defined by / 
into a single-peak terrain defined by fp by moving earth from higher position 
to lower position minimizing the loss of positional potential (it will be shown 
in Lemma 1 that the total mass of the terrain is invariant). This seems to be a 
basic problem in computational geometry and geography, and some results are 
given by Chun et al. in [5]. 

Constructing an optimal pyramid in two or higher dimensions is a natural 
extension of the problem of region segmentation [2], and is useful in several 
applications in data mining, statistics, data visualization [1] geomorphology, and 
computer vision [3]. 




Fig. 1. A pyramid approximation (right) of a piecewise constant function / (left) for 
d= 1 



In this paper, we investigate the condition under which the optimal pyramid 
can be efficiently constructed. Our condition is defined by using dominating sets 
of a directed graph. We can flexibly control the size of the family of regions 
by using the graph. In particular, we define the following d-dimensional region 
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Fig. 2. Pyramid approximation (right) of a 2-dimensional terrain (left), where /i is 
constant and /(c) and /p(c) are represented by using the gray levels of pictures (black 
and white represent 1 and 0, respectively) 



families: (1) stabbed-union of orthogonal regions, (2) generalized base-monotone 
regions, and (3) digitized star-shaped regions. Surprisingly, for each of these 
region families, the optimal pyramid can be computed in polynomial time in the 
number n of voxels by reducing the problem to the minimum s-t cut problem in 
a directed graph. 

Readers may worry that the number n of voxels may become large especially 
if d is high (say, more than three). However, we may combine voxels to obtain a 
compact structure to represent the functions to speed-up our algorithm. 



2 Properties of the Optimal Pyramid 

For a grid region R in F, we define /(c) /i(c) and /r(i?) = 

If it will cause no confusion, we also write f{R) for p{R) / p{R), which is weighted 
average of / over R. 

Lemma 1. For the optimal pyramid, X)cer(/('^) ~ fv{c))p-{c) = 0. 

Proof. The formula is rewritten as J2i=i p{P{l'i)\Piti+i)) — ti-pt{P{ti)\P{ti+i)) 
= 0. It is well-known in statistics that in order to minimize 
EiR) = EiLoEcGP(ti)\Ppi+i)(/(c) ti must be the weighted average 

of /(c) over P{ti) \ P{U+i). This means that U = p{P{ti) \ P{U+i)) / p.{P{ti) \ 
P{t i+i)), and we have the lemma. 

We can consider that the input is the pair of p and p, instead of p and /. 
Introducing a nonnegative parameter t, gt{R, p, p) = p{R) — t • p{R) is called the 
parametric gain of p against p within the region R. 

Lemma 2. Consider a function P{t) from (0,oo) to TZ satisfying that P(t) C 
P{t') for t > t' and maximizing the objective function 

J{V) = JZoPiP(t))-t-p{P{t))dt. 

Then, it has at most n -I- 1 transition values 0 = t_i < to < ti <■■■< th 
satisfying that P{t) = P{ti) for t G {ti-\,ti] and P(t) = 0 for t > th- Moreover, 
V consisting of P{to),P{t\), . ■ . ,P{th) is the optimal pyramid. 
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Proof. Since there are n = voxels in P, there are at most n -I- 1 differ- 
ent regions in P{t) for t G (0,oo). Thus, it is obvious that P{t) has at most 
n -I- 1 transition values. Using Lemma 1, J{V) = p{P{t)) ~ t • p{P{f))dt 

= Elo itu_, pinu)) - 1 ■ p{p{u))dt = Elom - u-i) • p{p(u)) - - 

tt,yp{p{u)) = Elot^p{P{u)\p{u+i))-t^/2-p{p{ty\p{u+i)) = Eto^f/2- 
p{P{ti) \ P{ti+i)). On the other hand, using U = p{P{U) \ P(ti+i))/^(P(L) \ 
P(t*+i)), wehaveP('P) := E^i=oEc&p{u)-p(u+^)U{c]-tif-p{c) = EcerP^(c)/ 
p{c)+Y!l=o{-‘2U-p{P{U)-P{ti+i))+ti-^i{P{ti)-P{U+i))} = Ecer P^(c)/m(c)- 
E’l=o^l-p{P{ii)-P{ii+i))- Thus, E{V) = Ec(^pP^{c)/h{c)-2J{V). Obviously, 
Ec^p P^ i® independent of the choice of the pyramid V, and hence the 

function P{t) maximizing J{V) defines the pyramid V minimizing E(V). This 
proves the lemma. 

Observing J{P), intuitively, the optimal pyramid V is obtained by piling up 
horizontal sections P{t) with as large parametric gains as possible. Consider the 
region R°P*(t) in TZ maximizing the parametric gain gt{R, p, p). If t increases, 
R°P*(t) tends to shrink. However, unfortunately, if we pile up the maximum gain 
regions R°P{t), they do not always form a pyramid, since R°P*{t) C R°P*{t') 
does not always hold for t > t'. 

2.1 Algorithms for a General Family 

For d = 1, it is natural to consider the set of all integral intervals as TZ. It has 
been shown that the optimal pyramid for the one-dimensional problem can be 
computed in 0{n) time [4]. 

In a higher dimensional case (d > 2), the time complexity largely depends on 
the specific family TZ of regions. Indeed, it is not difficult to see that the problem 
of computing just a single fiat of the optimal pyramid is NP-hard for some 
families even for d = 2 [2]. If TZ has M different regions, the optimal pyramid 
can always be computed in polynomial time in M and n for the d-dimensional 
case. We construct a directed acyclic graph G = {TZ, E) whose vertex set is TZ. 
For each pair R and R' of TZ, we give a directed edge e = {R, R') if and only if 
R Z) R' . We compute t{e) such that p{R \ R') = t{e) ■ g{R \ R'). The value t{e) 
is called the height label of e, and r(e) = t^(e) • p{R \ R')/2 is called the profit 
of e. A directed path p = Cq, ei, . . . , is called admissible if t{ei-i) < t{ei) for 
i = l,2,...,q. The profit of an admissible directed path is the sum of profit 
values of the edges on it. 

Lemma 3. The optimal pyramid is associated with the admissible path with the 
maximum profit in G, such that R\ R' is a flat of the pyramid if and only if 
{R, R') is an edge on that path. 

Thus, we can reduce the optimal pyramid problem to a maximum-weight- 
path problem in the directed acyclic graph G. Note that each directed path in 
G has at most n edges. By using a dynamic programming algorithm, we obtain 
the following result: 
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Theorem 1. The optimal pyramid for TZ of M different regions can he computed 
in 0{M‘^n) time. 

Unfortunately, the above algorithm is seldom practical. For example, the 
family of rectangular regions has 0{n^) regions, and hence the above time com- 
plexity is O(n^). Moreover, for computing accurate layered region rules, we want 
to consider families in which M is exponentially large in n. Thus, we seek to de- 
velop more efficient algorithms for some special families of regions. 

3 Closed Family and Domination Closure of a Graph 

A discrete family TZ of regions in the d-dimensional space is called a closed 
family (under Boolean operations) if it is closed under the intersection and union 
operations, that is, RH R' G TZ and i? U i?' G 7^ for any two regions R, R' in TZ. 
The following proposition is easy to see: 

Proposition 1. Given a closed family TZ, let R°P*(t) be the region in TZ maxi- 
mizing gt{R, p, pi). If there are multiple regions in TZ maximizing gt{R, p, pi), we 
take any one which is minimal under inclusion. Then, the series of transitions 
of R°P*{t) gives the optimal pyramid 'P for TZ. 

We consider families TZ that can be constructed using closed families as their 
building blocks. 

3.1 Domination-Closures in a Grid Graph 

In the voxel grid T, we first guess the cell c = (ci, C 2 , . . . , Cd) giving the peak 
of the pyramid, and define a directed graph G(c), whose vertex set consists of 
all voxels in T. For voxels p = {p\,p 2 , . . . ,Pd) and q = {qi,q 2 , • ■ • , qd), the Li 
distance between p and q is dist{p,q) = X)i=i \Pi ~ Ii\- The neighboring cells of 
p are the cells whose distance from p is one. For a cell p and its neighbor q, a 
directed edge is defined: its direction is {p, q) (i.e., from p to q) if dist{p, c) = 
dist{q, c) -I- 1 and otherwise (q,p). The graph G(c) is a weakly-connected directed 
graph with d{m — 1) x = 0{n) edges (we assume d is a constant), and c is 
its unique sink vertex (i.e., a vertex without outgoing edges). 

A subgraph H = {V, E) of G(c) is called a rooted subgraph if there exists a 
directed path in H from each vertex u of U to c. 

Given a rooted subgraph H = (V, E) of G(c), we say that a vertex u is El- 
dominated by another vertex v if there exists a directed path from u to m in 
H. An H -domination closure IF is a subset of V satisfying the condition that 
every id-dominated vertex by a vertex in W is contained in W. Clearly, each 
id-domination closure defines a grid region containing the cell c in E, and we 
like to identify such regions. Given a rooted subgraph id of G(c), we consider 
the family TZh that is the set of all id-domination closures. Since the domination 
closure property is closed under union and intersection, we have the following 
proposition: 
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Proposition 2. For a rooted subgraph FI of G{c), TZh is a closed family of 
regions. 

3.2 Algorithms for Computing the Optimal Pyramid 
with Respect to TZh 

We assume that /r(c) and p(c) are quotient numbers of nonnegative integers less 
than U for every cell c G F. In other words, U is the precision and N = nil gives 
an upper bound of the bit size of the input. 

Let us fix a rooted subgraph H of G(c). We consider a parameter value t 
defining the height of a flat of the optimal pyramid with respect to TZh, and we 
give a weight p{p) — t ■ p,{p) to each voxel p (and its corresponding vertex in H). 
Due to the following lemma (proof is routine) , we can assume that t is a rational 
number with small denominator and numerator. 

Lemma 4. If t is a height defining a flat of the optimal pyramid for a certain 
region family, then t is a rational number represented by a quotient of two integers 
less than or equal to N'^. 

By definition, the maximum (parametric) gain region P°^*{t) € TZh is the 
7L-domination closure maximizing the sum of weights of voxels in the region. 
In graph-theoretic terminology, this is the maximum domination closure of the 
weighted directed graph H, and is obtained as the connected component of FT 
containing c by removing a set of edges (the cut set). See Fig. 3 for a region 
P°P*{t) in TZh (here, H = G(c) and d = 2) and the corresponding domination 
closure. The number in each pixel p of the left picture is the weight p(p) -t-p,(jp). 
The following theorem is due to Hochbaum [7] (also see [9]): 

Theorem 2 (Hochbaum [7]). Given a directed graph G with n vertices having 
real-valued vertex weights and m edges, its maximum domination closure can 
be computed in 0(T(n,m)) time, where T{n,m) is the time for computing a 
minimum s-t cut in an n-vertex, m-edge directed graph with nonnegative edge 
weights. 
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Fig. 3. An optimal region and its corresponding cut set (dashed arrows) in G(c) 



Theorem 3. The optimal pyramid for the family TZh can be computed in 
0(n^'® lognlog^ A) time. The time complexity is improved to OlnlogN) if H 
is a tree. 
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Proof. We apply binary search for finding all possible heights of flats in the set S 
of quotient numbers defined by all pairs of integers less than or equal to N. We 
call the process “binary branching search” since we And more than one height. In 
the binary branching search, suppose we have P°P*(to) and for to < ti. 

If P°P*{to) = we set the interval / = as inactive-, otherwise, it is 

active and there may be a flat whose height is within the interval. If the interval is 
active, let tor be the center (precisely, an element of S approximating the center) 
of {to,t\). Because the family TZh is closed, P°P*{ti) C P°^‘(toi) C P°P*{to). 
Thus, to compute P°P*{toi), we can remove all vertices outside P°P*{to) from P[ 
and contract the vertices within P°P*{ti) into a single vertex. Thus, it suffices 
to compute the maximum domination closure in a directed graph PI {I) with 
|P°^’*(to)| — |P°^*(ti)| vertices and the same order of edges (since the outdegree 
of each vertex remains to be at most two after the contraction) . 

The binary branching search consists of multiple levels, such that in a level, 
an active interval created at the previous level is split into two intervals. It 
is easy to see that the number of levels is O(logfV). In each level, the sum 
of the numbers of vertices of H{I) for all active intervals / is 0{n). We plug 
in T{n,n) = 0{n^'^ log n log N) [6], and hence the computation time for pro- 
cessing each level is 0(n^'® log n log A^). Therefore, the total time complexity is 
0(n^'® lognlog^ N). The improved time complexity for a tree is easy to obtain. 

The algorithm given in Theorem 3 assumes that we know the peak c of the 
optimal pyramid, since the graph P[ depends on c. If we do not know the peak, 
a naive method is to examine all possible n cells of P as the candidates of the 
peak of the optimal pyramid, and report the one optimizing the objective func- 
tion. This method takes 0{n^'^ log n log^ N) time. However, we can practically 
guess the peak by to reduce the number of candidates of the peak to <C n, 
assuming that / has a good pyramid approximation. We omit describing the 
method in this version. If the method works, we can And the optimal pyramid 
in lognlog^ N) computation time without knowing the position of the 

peak in advance. 



4 Algorithms for Typical Region Families 

4.1 Stabbed Unions of Orthogonal Regions 

For a fixed cell c of P, a region P in P is called a stabbed union of orthogonal 
regions at c if P can be represented as the union of orthogonal regions each of 
which contains c. The cell c is called the center cell of P. 

Fig. 4 gives an example of the two-dimensional case. The pyramid given in 
Fig. 2 is based on the family of stabbed unions at a point. Naturally, the center 
cell (or point) gives the peak of the pyramid. 

The following lemma is straightforward from the definitions of G(c) and TZh ■ 

Lemma 5. A region R in P is a stabbed union at c if and only if it is a G{c)~ 
domination closure. 
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Corollary 

nal regions 



1. The optimal pyramid for the family of stabbed unions of orthogo- 
at a given cell c can be computed in 0(n^ ® log nlog^ TV) time. 



4.2 Based Monotone Regions 

A based monotone region is a grid region Rin T such that the intersection of R 
with each column of T is either empty or a continuous subcolumn whose lowest 
edge lies on the lower boundary of T ; That is, it is the connected lower half 
region given by a grid curve in T. The pyramid construction problem for the 
based monotone has been considered in [5]. 

The family of higher-dimensional based monotone regions containing a given 
fixed cell c is defined as follows. We fix a cell c G T in the d-dimensional voxel 
grid, and consider the graph G(c) (defined in Section 3.1). For each vertex p = 
{pi,P2, ■ ■ ■ ,Pd) c, we find the largest index i such that pi Ci, and select 
the outgoing edge of p corresponding to its i-th coordinate. Thus, we obtain a 
spanning tree To(c) of G(c), which we call lexicographic minimum bend spanning 
tree. In the voxel grid, the path from p to c in To(c) consists of at most d 
segments. 

It is easy to see that a region i? is a based monotone region containing 
c = (ci, 1) in a two-dimensional pixel grid if and only if it is a domination closure 
in Tq(c). This gives a d-dimensional analogue of the family of based monotone 
regions. The following theorem is obvious from Theorem 3. 

Theorem 4. The optimal pyramid with respect to the family of domination clo- 
sures ofTo(c) can be computed in 0{nlog N) time. 

4.3 Digitized Star-Shaped Regions 

In Euclidean geometry spaces (i.e., not the voxel grid settings), a popular closed 
family of regions is the family of star-shaped regions centered at a given point 
q. Recall that a region R is said to be star-shaped centered at q if for any point 
V € R, the line segment gU is entirely contained in R. The family of star-shaped 
regions is the closure with respect to the union and intersection of the family of 
convex regions containing q. 

However, its digital analogue is not a well-defined problem in general, since it 
depends on the definition of the digital line segment between two voxels. Unfortu- 
nately, it does not form a closed family if we adopt one of the popular definitions 
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of digital line segments, and analogously define the star-shape polygons in the 
digital geometry. 

Here, we give a probabilistic construction of the digital star-shaped regions 
by using G(c). Let c be the center of the digital star-shaped regions. The first 
idea is to construct a spanning tree T of G(c) rooted at c such that for each 
vertex v (i.e., a voxel of F) of T, the unique directed path pathriv, c) in T from 
w to c simulates a (true) line segment Uc in the digital space. 

Each vertex of G(c) has at most d outgoing edges. We obtain a spanning 
tree of G(c) by selecting exactly one outgoing edge for each vertex (except c). 
Consider a vertex corresponding to the voxel p = (pi,p2, ■ ■ ■ ,Pd)- Let dist{p) = 
Si=i \Pi ~ Ci| be the Lx distance from p to c. Note that dist{p) is also the 
distance from p to c in T. Our strategy for constructing T is to select the edge 
corresponding to the z-th coordinate with the probability \pi — Ci\/dist{p). 

Let Xi{k,p) be the number of edges corresponding to the z-th coordinate 
encountered during walking on the path pathT{p,c) from p hy k steps. Since T 
is constructed in a randomized fashion, Xi{k,p) is a random variable. 

Lemma 6. The expectation E = E{xi{k,p)) is k\pi — Ci\/dist{p). Moreover, 
the probability that \xi{k,p) — E\> x/2ak is smaller than 2e~“ for any positive 
number a, where e is the natural logarithm base. 

Proof. The process is Martingale, and the expectation can be obtained by an 
induction on k. The deviation bound comes from Azuma’s inequality [8]. 

This implies that pathrip, c) approximates the line segment pc for each ver- 
tex p. However, the set of T-domination closures thus defined has a serious defect 
in simulating the star-shaped regions, even in the two dimensional case: Geo- 
metrically, a T-domination closure is not always a simply connected region in T. 
This is because there are some leaf vertices in T that do not correspond to any 
boundary cells (i.e., cells touching the grid boundary) of T. 

For this purpose, for any leaf vertex z; of T corresponding to an internal cell 
of r, we need to add an incoming edge to v in order to obtain a new graph T. 
Again, we choose one of the incoming edges for z; in a probabilistic fashion. A 
simple calculation shows that the expected number of additional edges needed 
is zz/6 -I- o(zz) if d = 2 and bounded by zz/3 for d > 3. We then have a graph T 
as constructed above, and treat the set of domination closures in the graph T 
as a family of digitized star-shaped regions with the center c. See Fig. 5 for an 
example, and compare it with Fig. 3. 

Corollary 2. The optimal pyramid corresponding to T can be computed in 
0{n^-^logn\o^ N) time. 



4.4 Reduction of the Voxel Size 

Although our algorithms are polynomial in the voxel size n, n may become large 
and it is worried that the algorithm it too expensive to use in practice. For 
example, if m = 100 and d = A, n = 100,000,000. In such a case, we need to 
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Fig. 5. An optimal region and its corresponding cut set in T (only a part near c is 
displayed) 



preprocess the original input functions / and /r to reduce the input size for the 
algorithm; precisely speaking, to reduce the size of the graph G(c) considered in 
the algorithm. For the purpose, we unite neighbor voxels with similar feature to 
make a large cell, and contract the corresponding vertices of G(c) into one vertex 
to reduce the size of the graph. A typical method is the quad-tree method, in 
which we stop refining an intermediate cell if each voxel c in the cell has the 
values /(c) and /r(c) that is near (i.e., difference is below a fixed threshold) to 
the average of those values over the cell. Although this does not improve the 
worst-case time complexity, we can replace n in the time complexities by n' 
that is the size of the reduced representation of the inputs by spending 0(n) 
preprocessing time. 



5 Concluding Remarks 

One drawback of our method is that the pixel structure is essential, and the 
algorithms does not work for a terrain given by using a polyhedral representation 
as it is. However, we can modify our algorithm to work for a locally constant 
function on a triangulation if we define a suitable directed graph (based on 
the dual graph of the triangulation) corresponding to the grid graph such that 
each domination-closure of the graph has nice geometric shape. A more general 
problem is approximation by a terrain with k peaks instead of a unimodal terrain; 
in other word, it gives a simplification of a Morse complex of a given input terrain. 
This general problem is a challenging problem. 
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Abstract. The largest common point set problem (LCP) is, given two 
point set P and Q in d-dimensional Euclidean space, to find a subset of P 
with the maximum cardinality that is congruent to some subset of Q. We 
consider a special case of LCP in which the size of the largest common 
point set is at least (|P| -I- \Q\ — k)/2. We develop efficient algorithms for 
this special case of LCP and a related problem. In particular, we present 
an -I- fcn^logn) time algorithm for LCP in two dimensions, 

which is much better for small k than an existing 0(n®'^ log n) time 
algorithm, where n = max{|P|, |Q|}. 



1 Introduction 

Point set matching is one of well-studied problems in computational geometry 
and pattern recognition. Though approximate matching (in the sense of allowing 
errors in distances) is more important [4,7, 11], several studies have been done 
on exact matching. The congruence problem, which asks whether two input point 
sets P and Q {\P\ = \Q\ = n) in d dimensions are congruent, is one of the funda- 
mental exact matching problems. This problem can be solved in O(nlogn) time 
in two [10, 13] and three [6] dimensions. For higher dimensions, Alt et al. devel- 
oped an logn) time algorithm [4], which was improved to randomized 

time by Akutsu [2], and was further improved to randomized 
(^(j^d/4-i-0(i)) ] 3 y Matousek (private communication, see also [2]). 

The largest common point set problem (TCP) is also a fundamental exact 
matching problem [3]. This is a problem of, given two point sets P and Q in 
d-dimensional Euclidean space where |P| = m, |Q| = n and m < n, finding a 
subset of P with the maximum cardinality that is congruent to some subset of 
Q. Akutsu, Tamaki and Tokuyama developed an 0{{\{n,m) + n^)logn) time 
algorithm in two dimensions, where A(n, m) is a combinatorial number denoting 
the inner product of distance-multiplicity vectors of two point sets [3]. They 
gave a non-trivial upper bound of for X{n,m). They also gave 

algorithms in higher dimensions. 

It is worthy to notice a large gap of time complexity between the congruence 
problem and LCP: O(nlogn) time vs. 0((m^'^^n^'^^-|-n^) logn) time. Therefore, 
it is reasonable to ask whether this gap can be lessened if P and Q are very similar 
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Table 1. Comparison of time complexities for LCP with fc- differences. It is assumed 
that the sizes of P, Q and the largest common point set are 0{n), and log‘^*'^^(n) 
factors are ignored 
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this paper (deterministic) 
[3] (deterministic) 


+ n 
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this paper (randomized) 
[3] (randomized) 


+ n 

nf (trivial) 






+ n2-5 



(i.e., the largest common point set is close to P and Q). Precisely, we consider a 
special case of LCP in which at most k points are deleted from P U Q to obtain 
the largest common point set. In other words, we consider a special case of LCP 
in which the size of the largest common point set is at least {m+n—k)/2. We call 
this special case LCP with k- differences. We develop an 0{k^n'^/^ + kn^logn) 
time algorithm for this special case, which is much better for small k than the 
+ n^)logn) time algorithm. We also develop an 0{kn‘^^^logn + 
log^ n) time randomized algorithm which is also better for small k than a 
previous + logn) time randomized algorithm [3], where K 

denotes the size of the largest common point set. We also developed algorithms 
in one and three dimensions. The results on LCP are summarized in Table 1. 
These show that it is possible to lessen a complexity gap between the congruence 
problem and LCP if P and Q are very similar. 

Though it is not so common to consider fc-differences problems in point set 
matching, a lot of studies have been done for string matching with k-differences 
[9,12]. In particular, Landau and Vishkin developed an 0{kn) time algorithm 
for string matching with fc-differences [12], where n denotes the length of a text 
string. One of the main ideas in their algorithm is the use of a suffix tree for 
finding a maximally matching substring pair in constant time. We also employ 
this idea to identify a maximally matching segment pair in constant time, which 
allows us to break the barrier of A(n, m) factor in the time complexity. Though 
exact string matching was applied to the congruence problem in two dimensions 
[10,13], simple replacement of exact string matching with the Landau-Vishkin 
algorithm does not lead to an algorithm for LCP. 

In this paper, we show another and more practical application of the ideas 
in the Landau-Vishkin algorithm to a geometric problem. We consider the k- 
differences problem for numerical sequences. Many scientific data are represented 
as line plot graphs, where each line plot corresponds to a sequence of numerical 
data. For example, time series data of gene expression levels are considered to 
be a set of line plot graphs [18], where each line plot corresponds to time series 
of one gene. It is important to compare time series data because genes may 
have similar functions if their time series data are similar [18]. The fc-differences 
problem for numerical sequences can be trivially solved in 0{mn) time, where m 
denotes the length of pattern sequence and n denotes the length of text sequence. 
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Combining the ideas in [1], [5] and [12], we develop an 0{k^/^rn^/^n\ogn) time 
algorithm, which is better than the 0{mn) time algorithm for small k. 

In this paper, we adopt a random access machine (RAM) as a model of com- 
putation. As usual, we assume that the machine can store each O(logn) bit 
integer in a word and can perform each of arithmetic and logical operations in 
constant time. Furthermore, we assume that for handling coordinates of points, 
the machine can represent arbitrary real numbers and can exactly perform all 
the geometric computations involved (e.g., determining angles, distances, etc.) 
without round-off-errors. However, the latter can be weakened without increas- 
ing the orders of the time complexities so that each of addition, subtraction, 
multiplication and division on rational numbers can be performed in constant 
time with finite precision. Though we do not consider mirror images, all the 
results are valid even if rigid motions with mirror images are considered. 

2 One-Dimensional Case 

Before describing algorithms, we formally define LCP with fc-differences (for 
any fixed dimension d). Let P = {Pi,P 2 , ■ ■ ■ ,Pm} Q = {9d 92 ) ■ • ■ ) be 
point sets in d-dimensional Euclidean space, respectively. Let k he a, positive 
integer. Then, the largest common point set problem with fc-differences (which 
is abbreviated as LCP-ZcDIFF) is defined as follows. 

LCP-/cDIFF (Largest common point set with fc-differences) 

Given P, Q and k, find a rigid motion T such that |T(P) H Qj is the maximum 
under the condition that \P{P) HQ] > {m + n — k)/2 and output the size of 
|T(P) n Q|. If such T does not exist, output “not found”. 

It should be noted that the number of differences (i.e., the number of points not 
contributing to the largest common point set) is k' if and only if |T(P) n Q| is 
{m + n — k')/2. We use LCP also to denote the largest common point set. 

LCP in one dimension can be solved in O {rn^ri log n) time by using the fol- 
lowing naive algorithm. For each pair (Pig,qj^), we consider the uniquely deter- 
mined rigid motion T such that T{pj^J = Then, we compute T(P) n Q by 
examining for each p^ whether there exists qj such that P{Pi) = qj using binary 
search. Clearly, P{P) H Q can be computed in 0(m log n) time, from which the 
total 0{rn?n\ogn) computation time follows. 

Here we assume w.l.o.g. (without loss of generality) that P and Q are sorted 
in increasing order (i.e., Pi < P 2 < ■ ■ ■ < Pmi 9i < 92 < ■ • ■ < 9«)- For each 
pair (p^,qj), we define mmsp{pj^, qj) (maximally matching segment pair) to be 
the pair of {p^,Pi+i,Pi+ 2 , ■ ■ ■ ,Pi+h-i) and (q^, 9 ^+ 2 . • • ■,qj+h-i) with the 

maximum h such that p^_^.l^, — Pi = qj+h' ~ Qj holds for all h' < h (see also 
Fig. 1). We denote such h by \mmsp{Pi,qj)\. 

Lemma 1. There exists a data structure with which \mmsp{Pi, qj)\ can be an- 
swered in 0(1) time for given {i,j)- Moreover, the data structure can he con- 
structed in 0(n log n) time. 
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Fig. 1. Example of LCP-fcDIFF in one dimension, where k = 5. Matching point pairs 
(i.e., points in LCP) are connected by dashed lines. Shaded region shows mmsp{p 2 , q^) 



Proof. We use a technique developed in [12]. We define an alphabet S by 

^ = {p*+i -Pi\i = 1} U {Qj+i - qj\j = l,...,n- 1}, 

where identical vectors are regarded as the same element. We construct a sujjix 
tree for string strp^Q defined by 

Strp^Q = {P2-Pl,P3~P2, ■ ■ ■ ,Pm-Pm-l,#,Q2-Ql,Q3-Q2, ■ ■ ■ ,Qn-Qu-l,^), 

where #,$ ^ S. 0(n log n) time is sufficient for construction of E and the suffix 
tree [9,14]. Then, we can get \mmsp{Pi,qj)\ in 0(1) time by computing LCA 
(lowest common ancestor) between the leaf corresponding to the suffix starting 
Pi+i ~ Pi and the leaf corresponding to the suffix starting at qj+i — qj, with 
0(n) preprocessing time [15]. □ 

The procedure LCPKip{P, Q, k) outlined below computes the minimum dif- 
ference kmin, which corresponds to LCP with kmin differences. 

Procedure LCPKio{P,Q, k) 

Construct a suffix tree for strp^Q; 

kmin ^ k 1 ^ 

for zo = 1 to fc -I- 1 do 
for Jo = 1 to /c -h 1 do 

k' ^io+ jo - 2; zo; j ^ jo; 

Let T be a rigid motion such that T{p^^) = q^^; 
while i < m and j < n and fc' < fc do 

h ^ \mmsp{pi,qj)\; 

Find the minimum i' such that i' > i + h and (3j')(T(pj/) = q /); 

- (# 1 ) 

(If such z' does not exist within total error fc, 
then exit while loop and update fc' appropriately) 
k' ^k' + {i' -i-h) + {j' -j -h); i^i'; j ^ j'; 

if fc ^ kmin then kmin ^ fc 5 
if kmin E k then output kmin else output “not found” 

This procedure is similar to the Landau-Vishkin algorithm [12]. However, 
we do not use dynamic programming because — p^ should be computed if 
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Pi^i, do not contribute to LCP. Instead, we can use the rigid motion 

(uniquely determined by {Pi^,qjg)) to locate the next matching pair. 

Theorem 1. LCP-kDIFF in one dimension can he solved in 0{k^ + nlogn) 
time. 

Proof. The correctness of LCPKid{P, Q, k) is obvious because at most k points 
are deleted from PUQ and we need to examine at most {k + 1)^ rigid motions. 

Here we consider the time complexity. The while loop is repeated at most 
k+1 times per (to, jo)- Within the while loop, 0(1) time is sufficient except for 
the part of (#1). (#1) is computed as in the merge phase of the merge sort (i.e., 
i is incremented if P{Pi) < Qj, otherwise j is incremented). Since i and j are 
incremented at most 2k times in total, the total time required for (#1) is 0{k) 
per (to, jo)- Therefore, 0(fc) time is sufficient per (to, jo)- Since O(k^) pairs of 
(to, jo) are examined, the total time is 0{k^ + nlogn). □ 

In LCPKi]j{P,Q,k), we examined k+1 points from P. However, if we 
randomly sample a constant number of points P' from P (and corresponding 
0{k) points from Q), the probability that P' contains at least one point in LCP 
is high if k is not too large (e.g., k < mjc). The probability of error can be made 
small (e.g., less than l/n° ) if we repeat the procedure O(logn) times. 

Corollary 1. Suppose that c is an arbitrary fixed positive constant and k < mlc 
holds. Then, LCP-kDIFF in one dimension can he solved in 0{{k^ + n)logn) 
time with high probability. 

Though we considered a kind of global matching in the above, we can also 
consider local matching as in approximate string matching. In this case, we only 
count the errors (the number of points in P{P) UQ not included in LCP) in the 
range of [T(p^), T(p^)]. We call this problem LCP-fcDIFF-LOCAL. 

Corollary 2. LCP-kDIFF-LOCAL in one dimension can he solved in 0{k^n + 
nlogn) time deterministically, and in O(fcnlogn) time by a randomized algo- 
rithm with high probability assuming k < mlc. 

3 Two-Dimensional Case 

To solve LCP-/cDIFF in two dimensions, we reduce the two-dimensional case to 
a problem similar to LCP-fcDIFF-LOCAL. 

Suppose (until Lemma 2) that we have two point sets P' = P \J { pq } and 
Q' = Q U {go}, where Pq ^ P must matches Qq ^ Q in a rigid motion. We sort 
the points of P in the following way. Let 0{pf) (0 < 9{pf) < 27t) to be the angle 
of Pj — Po with respect to p^ — Po (i-e., 9{pf) = Lp^p^pf). Let |£c| denote the 
length of a vector x. Then, we define the total order ^ on P by 

Pi < Pi' ^ ^{Pi) < S{Pe) or (6»(Pi) = 9{pi,) and |Pi - p, J < \Pi, -PiJ)- 
The points of Q are sorted in the same way (see also Fig. 2). 
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Fig. 2 . Example of LCP-fcDIFF in two dimensions. Points are sorted according to . 
Points not included in LCP are shaded 



Now we can assume w.l.o.g. that A P 2 ^ ^ Pm^ -< ^2 ^ ^ 

q„. We define an equivalence relation ~ on {Pi+i,Pi) and {qj+i,qi) by 

{P^+l,P^) ~ {qj+i,qi) ^ \Pi+i-Po\ = l9i+i - 9ol and \Pi - Po\ = \qj ~ Qol 

and ^p,+lPoP^ = ^qj+iqoQj, 

where this relation is extended for {Pi+i,Pi)’s (resp. {qj+i,qj)'s). As in the one- 
dimensional case, we define an alphabet S' by S' = U 

{{qi+i.qi)\i = where = p^, q„+i = q^, and equivalent elements 

are regarded as the same element. 

Here, we construct a sequence PP = {p'l, . . . ,P 2 m-i) where Pi = Pi < m, 
otherwise p' = Pi_^- We also construct a sequence QQ = (q^, . . . , q2„_i) in an 
analogous way. Then, we construct a suffix tree for strpp^QQ over S' defined by 

Strpp^QQ = ( (P2.P1). (P3.P2). • ■ • . (P2m-l.P2m-2). #. 

( 92 ) 9 l). (93. 92). ■ • ■ . ( 92 n-l.P 2 n- 2 ). ^ )■ 

Clearly, all the constructions can be done in 0(n log n) time. 

For each rigid motion T satisfying 'P(pg) = q^ and T(p') = q' where i < 
fed- 1, we compute a common point set T({p', . . . ,p'_,_^_i}) C {q' , . . . , q'_,_„_i} 
as in the one-dimensional case. Since only fc-differences are allowed, it takes 0{k) 
time per T and thus 0{kn) time in total. 

Lemma 2. Let Pi and q^ are arbitrary points chosen from P and Q, respec- 
tively. Suppose that only rigid motions T satisfying P{Pi) = q^ are allowed. 
Then, LCP-kDIFF in two dimensions can he solved in 0{k‘^n -\- nlogn) time. 

In order to solve LCP-fcDIFF, it is sufficient to examine 0{kn) pairs of 
(Pq, qg). Precisely, we select fc -I- 1 points from P and all points from Q because 
at least one point in the selected fc -|- 1 points contributes to LCP. Therefore, it 
takes 0{k^n^ kn^ logn) time. 

However, this time can be reduced using the fact that there exists 0(n^/^) 
point pairs which have the same distance in two dimensions [16, 17]. Therefore, at 
most 0(fc^n^/^) rigid motions are examined and thus 0(fc^n^/^) time is required 
in total along with O(fcn^logn) time for pre-processing. Enumeration of such 
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motions can be done in O(k^ri^^^logn) time with O(n^logn) pre-processing 
time. Since k^n^^^logn is not greater than 0{k^n^/^ + fcn^logn), we have: 

Theorem 2. LCP-kDIFF in two dimensions can be solved in 0{k^n'^/^ + 
kn^ log n) time. 

We can also develop a randomized algorithm in two dimensions. In this case, 
we only sample a constant number of pairs (pg , Pj) . If fc < m/c holds for arbitrary 
fixed constant c, it is expected that both points in the pair are included in LCP 
with no less than some constant probability. For each pair, we enumerate pairs 
(qq, Qj) satisfying \p^ — pg| = \qj — qg|. Enumeration of such pairs can be done 
in 0(n^/^ log n) time with O(n^logn) pre-processing time. Then, we examine 
each rigid motion T defined by T(pg) = and T(pj) = q^. Since rigid 

motions are examined and 0{n^ logn) time is required for constructing E's and 
suffix trees in total, this randomized algorithm takes 0{kn'^^^+n^ log n) time. We 
repeat this algorithm 0(log n) times in order to increase the success probability. 

Corollary 3. Suppose that c is an arbitrary fixed positive constant and k < mic 
holds. Then, LCP-kDIFF in two dimensions can be solved in 0{kn'^^^ logn -\- 
log^ n) time with high probability. 

4 Three-Dimensional Case 

The algorithms for the two-dimensional case can be modified for three dimen- 
sions. We describe the required modifications in this section. 

In order to reduce the three-dimensional problem to a string matching like 
problem, we use two point pairs , q ^^ ) and (pj^ , qj^ ) and consider rotations 
around pffpff and (see Fig. 3). We can define the total order ^ on P by 

using another point Pj^ and considering the following for each Pj € P: 

— the angle between the planes defined by Pj^ , Pj^ , Pi and by Pj^ , Pj^ , Pig , 

“ {Pi ~ Pii) ■ {Pi 2 ~ Pii) where x ■ y denote the inner product, 

— distance between p^ and Pifpff. 

The total order on Q is defined in an analogous way. Once the total orderings 
are defined, we can use the following procedure. 

(i) for 0(fc2) pairs of (Pi,,p,J, for pairs of {qj,,qjJ such that \Pi^ - PiJ = 
\^j 2 - 9iil. execute (ii)-(hi), 

(ii) sort P, Q and construct a suffix tree, 

(iii) for 0{k) points of p,g , for q^^ and T satisfying T (Ap^^p^^Pi J= qj^ q ^^ , 
compute T(P) H Q under the condition that at most fc-differences are al- 
lowed. 

Theorem 3. LCP-kDIFF in three dimensions can be solved in 0{k‘^n^'^\ogn-\- 

^2^5/2 ^Qg2 
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Fig. 3. LCP-fcDIFF in three dimensions. For each rigid motion T such that 

T(P) HQ is computed 

Proof. Since the correctness is obvious, we consider the time complexity. 

For each there are 0(n^/^logn) (precisely, (log* 

pairs of (qj^,qj^) satisfying the required condition [8]. Since O(nlogn) time is 
enough per execution, step (ii) takes log^ n) time in total. 

The number of congruent triangles in three dimensions is 0(n^ ® logn) (pre- 
cisely, 0(n^'®(log* [3]. Thus, 0(/c^n^ ® log n) triangles are examined and 

step (iii) takes 0(/c^n^ ® log n -I- A:^n^ ®log^n) time in total. Since fc^n^ ®log^n is 
no greater than 0{k^n^-^ logn -I- log^ n), the theorem holds. □ 

Corollary 4. Suppose that c is an arbitrary fixed positive constant and k < mlc 
holds. Then, LCP-kDIFF in three dimensions can he solved in 0{kn^'^ log^ n -I- 
7 j 5/2 iog3 probability (using random sampling o/ j. 

5 Approximate Matching of Numerical Sequences 

Let P = {pi,. . . ,Pm) and Q = (gi, . . . , q„) be sequences of real numbers. We 
say that Pi matches qj if \pi — Qj\ < e, where e is a real number given as a 
part of an input. Then, the k-mismatches problem for numerical sequences is to 
enumerate all positions j of Q such that \pi — qj+i-i \ < e holds for at least m — k 
indices i. The k-differences problem for numerical sequences is to enumerate 
all positions j such that \pi, — < e holds for all Z = l,...,/i, where 

1 < Zi < < . . . < A < m and 1 < Ji < J 2 < • ■ • < jh hold, and the 

number of errors (mismatches, deletions and insertions) jh + m — 2h is at most k 
(see also Fig. 4). It is easy to see that the fc-differences problem (and also the k- 
mismatches problem) can be solved in 0{mn) time using dynamic programming. 
Here, we describe an o{mn) time algorithm for small k. 

The proposition below directly follows from the result on the smaller match- 
ing problem [5], which is to enumerate all positions j of Q such that pi < qj+i-i 
holds for alH = 1, . . . , m. 

Proposition 1. The k-mismatches problem for numerical sequences can he 
solved in 0{^/mnlogm) time. 

In order to develop an algorithm for the Zc-differences problem, we modify the 
Landau-Vishkin algorithm. One of the main ideas in Landau-Vishkin algorithm 
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Fig. 4. Approximate matching of numerical sequences. In this case, we have a match 
{(pii 93)) (P2, <74), (P4, <75)} with 1 difference (ps is considered to be deleted from P) 



is fast computation of \mmsp{pi,qj)\, where mmsp{pi,qj) is defined here to be 
a pair of (pj,p*+i, . . . and {qj,pj+i, . . . ,pj+h-i) with the maximum h 

such that Pi+h' matches qj+h' for all h' < h. If we could use a suffix tree, 
|mmsp(pi, ( 7 j)| would be computed in 0(1) time. However, we can not use a 
suffix tree here because pi can match multiple g^’s (having non-identical values). 
Therefore, we employ a technique introduced in [1] so as to efficiently (but less 
efficiently than using suffix trees) compute \mmsp{pi,qj)\. 

Let M {M < to) be an integer whose exact value is to be determined later. 
We divide P into m/M blocks Pi, P 2 , • ■ • , Pm/M: where each Pi has length M 
and we assume w.l.o.g. that m/M is an integer. For each of Pt, we compute a 
table IF[t, j], where IF[t, j] denotes the maximum number of blocks s such that 
Pf , . . . , Pt+s-i matches qj, . . . , qj+Ms-i- Once this table is constructed, we 
can compute \mmsp{pi,qj)\ in 0{M) time: we check each pair {pi+i,qj+i) one 
by one until i + I mod M = 1, and then we look up W\{i + I — 1/M) -|- 1, j -I- 1] 
(let w be that value) and jump to the pair (j>i^i^wM,qj+i+wM), after which we 
need to check at most M pairs. Since \mmsp{pi,qj) \ is computed in 0{M) time, 
we can modify the Landau- Vishkin algorithm so that it works in 0{kMn) time. 

Table TT[t, j] is constructed as follows. For each block Pi, we solve 1-mismatch 
problem for numerical sequences using Proposition 1 and enumerate all positions 
j of Q such that Pi matches qj,qj+i , . . . , qj+M-i- W[t,j] is initially set to 1 if Pi 
matches with . . . , otherwise W[t,j] is set to 0. After this, W[t,j] 

is replaced by the maximum number w such that lF[t, j] = W\t -|- 1, j -I- M] = 
. . . = W[t + w — l,j + {w — 1)M] . It is not difficult to see that table construction 
can be done in 0(^y/Mlogm) time. 

Solving kMn = ^y/M, we have M = (^)^^^. In practice, it is enough to 
set M an integer nearest to M = (^)^/^. 

Theorem 4. The k-differences problem for numerical sequences can he solved 
in 0{k^^^mf/^n\ogm) time. 
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Abstract. Given a convex polygon P with n vertices, we present al- 
gorithms to determine approximations of the largest axially symmetric 
convex polygon S contained in P, and the smallest such polygon S' 
that contains P. More precisely, for any e > 0, we can find an axially 
symmetric convex polygon Q G P with area \Q\ > (1 — e)|S'| in time 
0{n -I- 1/e®^^), and we can hnd an axially symmetric convex polygon Q' 
containing P with area IQ' I < (l-|-e)|S'| in time 0(n -I- (1/e^) log(l/e)). 
If the vertices of P are given in a sorted array, we can obtain the same re- 
sults in time 0((l/y'e) log n-l-l/e®^^) and 0((l/e) logn-l-(l/e^) log(l/e)) 
respectively. 



1 Introduction 

A polygon is axially symmetric if it is symmetric with respect to a line. In 
this paper we present algorithms to determine, for a given convex polygon P, 
approximations of the largest axially symmetric convex polygon contained in P, 
and the smallest such polygon that contains P. 
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There are a number of papers that study the best inner approximation of any 
convex set by a symmetric set; the distance to a symmetric set can be considered 
a measure of its symmetry [7]. Lower bounds for this distance are given by the 
Lowner-John ellipsoid [8]: any planar convex body C lies between two homothetic 
ellipses E C C C 2E with homothety ratio at most 2. Since any ellipse is axially 
symmetric, and area(if) = | area(2_E) > |area(C), any convex planar set C 
contains an axially symmetric subset with at least 1/4 of the area of C. The 
same lower bound of 1 /4 follows from the fact that any planar convex body lies 
between two homothetic rectangles with homothety ratio at most two [10,13]. 
The lower bound can be raised to 2/3 [9], but that is not known to be tight. 

The largest centrally symmetric set contained in a convex shape C is the 
maximum intersection of C and a translate of —C. If C is a convex n-gon, this 
can be computed in 0(n log n) time [5] . Approximation by axially symmetric sets 
is technically more demanding, as the largest axially symmetric set contained 
in C is the maximum intersection of C and a rotated and translated copy of 
C (with C a fixed axially refiected copy of C). We do not know of any exact 
algorithm to compute the maximum intersection of two convex polygons under 
translation and rotation (orientation-preserving rigid motions), indeed it is not 
clear that such an algorithm can exist within a reasonable model of computation. 

A recent manuscript [4] proposes a practical algorithm to compute exactly 
the largest subset of a (not necessarily convex) n-gon with an axial symmetry; 
but it requires to solve 9{n^) intractable optimization problems. This is our 
motivation to give a fast approximation algorithm for the problem. We can find 
an e-approximation to that set in time 0(n-|- 1/e^/^). We also obtain a sublinear 
time algorithm when the vertices of P are given in a sorted array. 



Theorem 1. Let P he a convex polygon in the plane with n vertices. Given 
e > Q, we can find a set Q C P with axial symmetry and 

area(Q) > (1 — e) max|area(Q*) Q* C P and Q* axially symmetric | 

in time 0{n + 1/e^/^). If the vertices of P are given in a sorted array, we can 
find Q in time 0{{l/^/e) logn -I- 1/e^/^). 



The problem of outer approximation of a convex polygon by an axially sym- 
metric polygon seems to have received less interest than inner approximation, 
perhaps because this is equivalent to the inner approximation problem if one 
drops the requirement that the axially symmetric polygon has to be convex. 
The results on approximation by homothetic pairs (ellipses or rectangles) cited 
above give again simple bounds: for each convex set C there is an axially sym- 
metric set D containing C with area(Zl) < 4area(C). The constant 4 can be 
reduced to 31/16 [9], again this is probably not tight. We give a fast approxi- 
mation algorithm for this problem (the proof will be given in the full version of 
this paper). 

Theorem 2. Let P he a convex polygon in the plane with n vertices. Given 
s > 0, we can find a convex set Q D P with axial symmetry and 



area(Q) < (1-l-e) min|area(Q*) 



Q* P and Q* convex and axially symmetric^ 
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in time 0{n + (1/e^) log(l/£)). If the vertices of P are given in a sorted array, 
we can find Q in time 0{{l/e) logn + (1/e^) log(l/e)). 

Our algorithms are based on three key ideas. 

— First, we approximate the input polygon by a convex polygon whose size 
depends only on e. In Theorem 1, we use Dudley’s constructive proof [6]. A 
recent paper by Agarwal et al. [1] uses this idea as well. This construction 
does not seem to help in Theorem 2, so we use an approximation in a stronger 
sense, but with larger size. 

— Second, we discretize the set of directions and sample only directions in a 
discrete set. This works well as long as the polygon is not long and skinny. 
Fortunately we can show that for long and skinny polygons, the axis of an 
optimal symmetry must be very close to the diameter of the polygon, or 
must be nearly orthogonal to this diameter. 

— Finally, we use an algorithm to compute the optimal solution for a given 
direction of the axis of symmetry. In Theorem 1, this is equivalent to finding 
the translation of C" that maximizes the area of CnC'. As mentioned before, 
this can be done in time O(nlogn) [5]. In our case, it suffices to consider a 
one-dimensional set of translations, which permits a linear time solution [3]. 
This solution makes use of the area of cross-sections of a three-dimensional 
polytope and the Brunn-Minkowski Theorem. We do not know of a similarly 
efficient solution for the case of Theorem 2, so we will give a plane sweep 
algorithm. 

2 Approximating a Convex Polygon 

A key component of our proofs is an approximation to a given polygon whose size 
depends only on e. We will employ two different methods. For the circumscribed 
case, we use a simple sampling approach (see Lemma 1). For the inscribed case 
we use Dudley’s method [6], which gives an approximation with fewer vertices, 
but in a weaker sense (see Lemma 2). This partly explains why the running time 
of our algorithm for the inscribed case has a lower dependency on 1/e. 

Here and in the rest of this paper, |P| denotes the area of a polygon P. For 
two sets A and B such that A C B, the Hausdorff distance between A and B is 

max( min I la — 611 ) 
fees VaGA " '7 

where ||a — 6|| is the Euclidean distance between a and b. 

Lemma 1. Given a convex n-gon P and £ > 0, one can construct in time 0{n) 
a convex polygon P^ C P with 0{l/s) vertices such that the Hausdorff distance 
between P and P^ is less than £|P|/30diam(P). If the vertices of P are given 
in a sorted array, we can find P^ in time 0{{l/e)logn). 

Proof. Given the convex n-gon P, we first determine a pair {a, 6} of diametral 
vertices of P in 0{n) time ([12] p. 176), and then find the width of P orthogonal 
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to ab, also in 0(n) time. This results in a rectangle R, one side of which is 
parallel to ab and has length diam(P), and P touches all four sides of R. We use 
an orthonormal basis where the a;-axis is parallel to ab. Intuitively, we will select 
0(l/e) vertices of P that are evenly spaced with respect to their y-coordinate. 

The boundary of P consists of two y-monotone chains Ci and Cr- We select 
vertices of these chains to form the set of vertices of P^. First we select the 
lowest vertex (that is, the vertex with smallest y-coordinate). Then we traverse 
Cl (resp. Cr) upward. After selecting any vertex v, we select the next vertex v' 
as follows: 

— if the vertex v" that immediately follows v along C/ (resp. Cr) is such that 
y(v") — y(v) > e|P|/30diam(P) then select v' = v" . 

— otherwise v' is the highest vertex of C/ (resp. Cr) such that y(v') — y{v) < 
e|P|/30diam(P). 

This ensure that any two consecutive vertices of P^ either are consecutive along 
P, or their vertical distance is at most £|P|/30 diam(P). So the Hausdorff dis- 
tance between P and P^ is less than £|P|/30 diam(P). 

The area of R is at most 2|P|, so the width of R is at most 2|P|/ diam(P). 
Also note that, after two steps of the above construction, the y-coordinate has 
increased by at least £|P|/30 diam(P). Therefore P^ has 0(l/£) vertices. Clearly, 
this construction allows us to compute P^ in 0{n) time. The sublinear time 
algorithm when the vertices of P are sorted in an array will be given in the full 
version of this paper. □ 

Lemma 2. Given a convex n-gon P and £ > 0, one can construct in time 0{n) 
two convex polygons Pe and P^ with 0(1/ y/e) vertices, such that Ps C P C Pg, 
and |Pg \ Pg| < \e\P\. If the vertices of P are given in a sorted array, we can 
find Pg in time 0{{l/^/e)logn). 

Proof. As in the proof of Lemma 1, we first find in time 0(n) a rectangle R, one 
side of which is parallel to a diametral pair ab, and P touches all four sides of R. 
Since the claim of the lemma is invariant under affine transformations we can 
apply a transformation that maps R to the unit square; thus in the following we 
will assume that P is inscribed in the unit square, and touches all its four sides. 

We go once around P, starting at an arbitrary vertex, and select a subset of 
the edges of the polygon. We always choose the first edge. Let now e = uu' be 
the most recently chosen edge, let e' = vv' be the next candidate edge, and let 
e" = v'v" be the edge following e'. We choose e' if 

— the distance d{u',v') > ^/e, or 

— the outer normals of e and e" make an angle larger than 

This process results in a sequence of polygon edges, once around the polygon. 
We observe that the number of edges selected is 0(\j ^/e). The perimeter of the 
polygon P is 0(1) (it is inscribed in the unit square), so only 0(\j ^/e) edges can 
be chosen according to the first rule. The total change of the outer normal angles 
is 27 t = 0(1), so only 0(\j ^fe) edges can be chosen according to the second rule. 
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Let Pe be the convex hull of these selected edges, and be the polygon 
obtained by extending the selected edges until they form a convex polygon. 
Then P^ C P C P^. 

It remains to bound \P^ \ Pe\. The difference consists of 0{l/y/e) triangles 
uu'v, where u and u' are vertices of P, and v is the intersection of the lines 
supporting two consecutive chosen edges. By the first edge selection criterion, 
the distance d{u, u') < \fe. By the second criterion, the lines uv and u'v make 
an angle of at most ^/e, and so the angle Zuvu' > tt — ^/e. Together this implies 
that the area of the triangle is at most It follows that |P£\Pe| = 0{e). The 
area of P, on the other hand, is at least 1/2, since P is a convex set inscribed in 
the unit square, touching all sides and with a diameter parallel to one side. This 
shows that |P/ \ Pe| = 0(e|P|), which proves the lemma. The sublinear time 
algorithm when the vertices of P are sorted in an array will be given in the full 
version of this paper. □ 

3 Discretizing the Rotation 

To find the optimal axis of symmetry, we will test a discrete set of orientations. 
In this section we bound the error incurred by this discretization. We start by 
analyzing the change in a convex set being rotated. (A proof of this lemma will 
be given in the full version of this paper.) 

Lemma 3. Let C he a convex set in the plane, and let C he a copy of C , rotated 
hy an angle S around a point p in C. Then \C n C'\ > \C\ — 7r<5diam(C')^. 

The occurrence of diam(C')^ instead of \C\ is a problem. In the following 
sections, we will need to argue that if the set is long and skinny, that is, when 
diam(P)^ is much larger than |P|, we need to sample only directions almost 
parallel to the diameter. 

In the following we denote by refl(-, f) the reflection at line £, so that refl(C, £) 
is the reflected image of C under reflection at £. 

Lemma 4. Let £ and £' he two lines intersecting in a point p with an angle S, 
and let P he a convex set. Lf p G P n refl(P, ^), then 

|P n refl(P, £')| > |P n refl(P,£)| — 27r<5diam(P)^. 

Proof. The concatenation of the reflection at £ and the reflection at £' is a ro- 
tation around p by the angle 2(5. Let Q := P n refl(P, ^). Since Q is symmetric 
with respect to £, the set refl(Q,P) = reR{reR{Q , £) , £') is a copy of Q rotated 
around p by 2<5. Since p G Q, Lemma 3 implies that 

IQ n refl((5,^')| > IQI — 27r5diam(Q)^ • 

Since Q C P, we have 

P n refl(P, £') L> Qn refl(Q, £') , 
and by the above that implies 

|P n refl(P, ^')| > IQI — 27T(5diam((5)^ > |P H refl(P, £)| — 2t:6 diam(P)^ . 

□ 
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A similar bound holds for conv(P U refl(P, ^)). 

Lemma 5. Let i and £' he two lines intersecting in a point p with an angle S, 
and let P he a convex set. If p G P, then 

I conv(P U refl(P, £'))! < | conv(P U refl(P, £))| + 47 t( 1 + 7r/2)i5diam(P)^ . 

Proof. Let Q := conv(P U refl(P, f)) and Q' := conv(P U refl(P, P)). As in 
Lemma 3, we argue that any point of refl(P, £') has distance at most 2S diam(P) 
from some point of refl(P, £). This implies that Q' is contained in the Minkowski- 
sum of Q with a disk of radius 2(5diam(P). This Minkowski-sum has area 

IQI + 26 diam(P) peri(Q) + 7 t(2i 5 diam(P))^ . 

Since p G P, we have P n refl(P, yf 0, and so diam((5) < 2diam(P). This 
implies peri(Q) < 27rdiam(P), and we obtain 

\Q'\ < IQI + 47t(i 5 + (5^) diam(P)^ < \Q\ + 47t(1 + 71:12)5 diam(P)^. 

□ 

4 The Largest Axially Symmetric Inscribed Set 

Let C be a convex set in the plane and let ^ be a line. For any line £ the set 
C n refl(C', £), if it is not empty, is an axially symmetric convex subset of C, the 
largest axially symmetric subset with reflection axis £. 

Our goal is to And, for a convex polygon P, the line £°p*(P) that maximizes 
the area of this set: 

P n refl(P, ^°P*(P)) = max P n refl(P, £) . 

As we discussed in the introduction, Lassak proved the following lower bound [9]: 

Lemma 6. 

Pnrefl(P,rP‘(P)) > ^|P| 

o 

Theorem 1 claims that at least an £-approximation with 

(1 -£) Pnrefl(P,rP*(P)) < Pnrefl(P,4) 

can be found fast. Our proof of Theorem 1 relies on Lemma 2, Lemma 6 and 
the following two lemmas. 

Lemma 7. Given a convex n-gon P and a line £, one can find in 0(n) time the 
line £' parallel to £ that maximizes P n refl(P, £') . 

Proof. Let Q := refl(P, f), and let t be a vector orthogonal to £. For any line 
£' parallel to £, refl(P, £') is the translation of Q by a multiple of t, and so the 
problem is equivalent to finding the A G K such that |Pn (Q + At)| is maximized. 
A linear-time algorithm to solve this problem has been given by Avis et al. [3] . 

□ 
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We will apply this algorithm to a set of 0(1 /e) direction. The following proof 
shows how to find this set. Intuitively, when P is fat we will just sample the space 
of directions uniformly. When P is long and skinny, we will sample more densely, 
but we will only sample near the two axis of symmetry of a bounding rectangle 
R that is parallel to a diametral segment ab (see Fig. 1). 




Fig. 1. Case where P is long and skinny. We sample directions from the shaded area. 



Lemma 8. Given a convex n-gon P and e > 0, one can construct in time 
0{n + 1/e) a set o/0(l/e) directions such that 



( 1 - ¥) 



Pnrefl(P,rP"(P)) 



< max| I P n refl(P, £) | : i has a direction from Dg | 



Proof. By Lemma 4 and Lemma 6 it is sufficient to choose the set such that 
it contains a line that makes an angle 6 of at most e|P|/67rdiam(P)^ with 

We start by computing, in time 0{n), a diameter ab of P, and the area |P|. 
We then distinguish two cases. 

If diam(P)^ < 20|P|, then we generate by sampling the direction space 
uniformly, choosing multiples of e/GOtt. Since £/1207 t < £|P|/ 67rdiam(P)^, this 
is sufficient. 

If, on the other hand, diam(P)^ > 20|P|, then we sample uniformly the 
directions within 37 t|P |/2 diam(P)^ of the direction of the diameter ab, choosing 
multiples of £|P|/37rdiam(P)^. We do the same around the direction that is 
orthogonal to ab. To show that this is sufficient we have to demonstrate that 
£°P*(P) does not make an angle larger than 37 t|P|/ 2 diam(P)^ with the direction 
of the diameter or with the direction that is orthogonal to the diameter. 

As in the proof of Lemma 2, we can choose a rectangle R parallel to ab 
of length diam(P) such that P touches all four sides of R. This implies that 
the area of R is at most 2|P|, and so its width is at most 2|P|/ diam(P). It 
follows that P lies in an infinite strip of width at most 2|P|/ diam(P). Let 7 S 
[0,7 t] be the angle made by the lines ^°p*(P) and ab. The set refl(P, £°p*) is 
contained in a congruent strip, intersecting the strip of P at an angle 2y. The 
set Pnrefl(P, £°p*^) is contained in the intersection of the strips, which has area at 
most 4|Pp/ diam(P)^| sin 27/ By Lemma 6, we know that |Pnrefl(P, i°^^{P))\ > 
2|P|/3, so the angle 7 must satisfy 



^ m 

diam(P)2| sin27| ~ 3 
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thus |sin27| < 6|P|/ diam(P)^. It means that we are in one of the following 
three cases: 7 < 37 t|P|/ 2 diam(P)^, or 71 — 7 < 37 t|P|/ 2 diam(P)^, or I7 — 7 t/ 2| < 
37 t|P |/2 diam(P)^. □ 

We can now complete the proof of Theorem 1 . Given the convex polygon P 
and £ > 0, we construct the outer approximating polygon P^ of Lemma 2, obtain 
for this polygon a set of 0(l/e) directions from Lemma 8, and determine for each 
of them the optimal line by Lemma 7. 

It takes time 0(n) to construct P', or even 0((I/£) logn) when the vertices 
of P are given in a sorted array. It takes time 0(I/-y/e + 1/e) = 0{l/e) to 
construct Pg, and for each of the 0(I/e) directions it takes time 0{l/y/e) to 
find the optimal line of that direction. Together this is the claimed complexity 
of 0{n+ l/£^/^), or 0((l/-ye) logn + l/e^^^)) when the vertices of P are given 
in a sorted array. 

It remains to show that the line £g with the largest intersection gives the 
approximation claimed by Theorem 1. 

(1 - ie) Pnrefl(P,rP‘(P)) < (1 - ^£) P^ n refl(P;, rP*(P)) 

<(l-i£)P;nrefl(P;,rP*(Pg')) 

< P;nrefi(Pg',4)| 

< Pnrefl(P,4) +2|Pg'\P| 

< Pnrefl(P,4) + 

< Pnrefl(P,4) + ie Pnrefl(P,rP‘(P)) . 

In the last inequality we used Lemma 6. It follows that 

(1-e) Pnrefl(P,rP‘(P)) < Pnrefl(P,4) , 

which completes the proof of Theorem 1 . 

5 Concluding Remarks 

Lopez and Reisner proposed a different algorithm for approximating a convex n- 
gon by an inscribed polygon with few vertices [11]. They obtain in time 0(n+{n— 
k) log n) an approximating polygon with k vertices and relative approximation 
error 0{l/k^). Our method (see Lemma 2) improves on it as we obtain the same 
approximation error, with the same number of vertices, in time 0{n). When the 
vertices of P are given in a sorted array, our time bound becomes O(fclogn). 

As mentioned before, the problem solved by Theorem 1 is a special case of 
the problem of maximizing the overlap of two convex polygons P and Q under 
translation and rotation of Q. Surprisingly little is known about this problem. 
Alt et al. [2] made some initial progress on a similar problem, showing, for 
instance, how to construct, for a convex polygon P, the axis-parallel rectangle Q 
minimizing the symmetric difference of P and Q. 
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Our solution does not generalize to this problem. We make use of two special 
properties of our problem. First, as discussed before, we know that the area of 
the optimal solution is within a constant factor the area of P. Second, we have 
a specific argument if P is very long and skinny. 



References 

1. P. K. Agarwal, S. Har-Peled, K. R. Varadarajan: Approximating Extent Measures 
of Points, manuscript 2003. 

2. H. Alt, J. Blomer, M. Godau, and H. Wagener: Approximation of convex polygons, 
Ptoc. 17th Internat. Colloq. Automata Lang. Program., Lecture Notes Comput. 
Sci. 443, p. 703-716, Springer- Verlag 1990. 

3. D. Avis, P. Bose, G. Toussaint, T. Shermer, B. Zhu, J. Snoeyink: On the sectional 
area of convex polytopes, Proc. 12th ACM Symp. Comput. geometry, (1996) 411- 
412. 

4. G. Barequet, V. Rogol: Maximizing the area of an axially-symmetric polygon in- 
scribed by a simple polygon, manuscript 2003. 

5. M. de Berg, O. Devillers, M. van Kreveld, O. Schwarzkopf, M. Teillaud: Comput- 
ing the maximum overlap of two convex polygons under translations. Theory of 
Computing Systems 31 (1998) 613-628. 

6. R. M. Dudley: Metric entropy of some classes of sets with differentiable boundaries, 
J. Approximation Theory 10 (1974) 227-236; Erratum in J. Approx. Theory 26 
(1979) 192-193. 

7. B. Griinbaum: Measures of symmetry of convex sets, in ‘Convexity’, V. Klee, ed., 
Proc. Symp. Pure Math 7, Amer. Math. Soc. 1963, 233-270. 

8. F. John: Extremum problems with inequalities as subsidiary conditions, in 
‘Courant Anniversary Volume’, 1948, 187-204. 

9. M. Lassak: Approximation of convex bodies by axially symmetric bodies, Proc. 
Amer. Math. Soc. 130 (2002) 3075-3084. Erratum in Proc. Amer. Math. Soc. 131 
(2003) p.2301 

10. M. Lassak: Approximation of convex bodies by rectangles, Geometriae Dedicata 
47 (1993) 111-117. 

11. M.A. Lopez, S. Reisner: Efficient approximation of convex polygons, Internat. J. 
Comput. Geom. Appl. 10 (2000) 445-452. 

12. P. Preparata, M.I. Shamos: Computational Geometry: An Introduction, Springer- 
Verlag 1985. 

13. O. Schwarzkopf, U. Fuchs, G. Rote, E. Welzl: Approximation of convex figures by 
pairs of rectangles, Comput. Geom. Theory Appl. 10 (1998) 77-87; also in STAGS 
1990, LNCS 415, p. 240-249. 

14. B. A. deValcourt: Axially symmetric polygons inscribed in and circumscribed about 
convex polygons, Elemente Math. 22 (1967) 121-133. 




The Traveling Salesman Problem with Few Inner Points 



Vladimir G. Demeko^ *, Michael Hoffmann^, 

Yoshio Okamoto^’ **, and Gerhard J. Woeginger^ * 

^ Warwick Business School, The University of Warwick, 

Conventry CV4 7AL, United Kingdom 
orsvdOwbs . Warwick . ac . uk 

^ Institute of Theoretical Computer Science, ETH Zurich, CH-8092 Zurich, Switzerland 
{hof fmann, okamotoy}®inf . ethz . ch 
® Department of Mathematics and Computer Science, TU Eindhoven, 

P.O. Box 513, 5600 MB Eindhoven, The Netherlands 
gwoegi@win . tue . nl 



Abstract. We study the traveling salesman problem (TSP) in the 2-dimensional 
Euclidean plane. The problem is NP-hard in general, but trivial if the points are 
in convex position. In this paper, we investigate the influence of the number of 
inner points (i.e., points in the interior of the convex hull) on the computational 
complexity of the problem. We give two simple algorithms for this problem. The 
first one runs in 0{k\kn) time and 0(fc) space, and the second runs in 0(2^ k^n) 
time and 0{2^kn) space, when n is the total number of input points and k is 
the number of inner points. Hence, if k is taken as a parameter, this problem is 
fixed-parameter tractable (FPT), and also can be solved in polynomial time if 
k — 0(log n). We also consider variants of the TSP such as the prize-collecting 
TSP and the partial TSP in this setting, and show that they are FPT as well. 



1 Introduction 

The traveling salesman problem (TSP) is one of the most famous optimization prob- 
lems, which comes along many kinds of applications such as logistics, scheduling, 
VLSI manufacturing, etc. In many practical applications, we have to solve TSP in- 
stances arising from the two-dimensional Euclidean plane, which we call the 2DTSP. 
Also most of the benchmark instances for TSP belong to this class. Theoretically speak- 
ing, the general 2DTSP is strongly NP-hard [9, 15]. On the other hand, the problem is 
trivial if the points are in convex position, i.e., they are the vertices of a convex polygon. 
Therefore, the following natural question is asked; What is the influence of the number 
of inner points on the complexity of the problem? Here, an inner point of a finite point 
set P is a point from P which lies in the interior of the convex hull of P. Intuitively, we 
might say that “Fewer inner points make the problem easier to solve.” 

In the following, we answer this question and support the intuition above by provid- 
ing simple algorithms based on the dynamic programming paradigm. The first one runs 

* Supported by the NWO project 613.000.322 “Exact Algorithms”. 

** Supported by the Berlin-Zurich loint Graduate Program “Combinatorics, Geometry, and Com- 
putation” (CGC), financed by ETH Zurich and the German Science Foundation (DEG). 

K.-Y. Chwa and J.I. Munro (Eds.): COCOON 2004, LNCS 3106, pp. 268-277, 2004. 

(c) Springer- Verlag Berlin Heidelberg 2004 



The Traveling Salesman Problem with Few Inner Points 



269 



in 0{k\kn) time and 0(fc) space, and the second runs in 0{2^k^n) time and 0{2^kn) 
space, where n is the total number of input points and k is the number of inner points. 
Hence, from the viewpoint of parameterized complexity [7, 14], these algorithms are 
fixed-parameter algorithms', when the number of inner points is taken as a parameter, 
hence the problem is fixed-parameter tractable (FPT). Observe that the second algo- 
rithm gives a polynomial-time solution to the problem when k = 0(log n). 

We also study two variants of the traveling salesman problem, both also strongly 
NP-hard: the prize-collecting traveling salesman problem, introduced by Balas [2], and 
the partial traveling salesman problem. We show that these problems on the Euclidean 
plane are FPT as well. 

Related work. Since the literature on the TSP and its variants is vast, we only point out 
studies on the TSP itself which are closely related to our result. To the authors’ knowl- 
edge, only few papers studied the parameterized complexity of the 2DTSP. Probably 
the most closely related one is a paper by Deineko, van Dal and Rote [5]. They studied 
the 2DTSP where the inner points lie on a line. The problem is called the convex-hull- 
and-line TSP. They gave an algorithm running in O(fcn) time, where k is the number of 
inner points. Deineko and Woeginger [6] studied a slightly more general problem called 
the convex-hull-and-i-line TSP, and gave an algorithm running in 0{k^n?) time. Com- 
pared to these results, our algorithms deal with the most general situation, and are fixed- 
parameter algorithms with respect to k. As for approximation algorithms, Arora [1] and 
Mitchell [13] found polynomial-time approximation schemes (PTAS) for the 2DTSP. 
Rao and Smith [16] gave a PTAS with better running time 0(nlogn -F 
As for exact algorithms. Held and Karp [10] and independently Bellman [3] provided 
a dynamic programming algorithm to solve the TSP optimally in 0(2”n^) time and 
0(2"n) space. For geometric problems, Hwang, Chang and Lee [12] gave an algorithm 
to solve the 2DTSP in time based on the so-called separator theorem. 

Organization. The next section introduces the problem formally, and gives a fundamen- 
tal lemma. Sect. 3 and 4 describe algorithms for the 2DTSP. Variations are discussed in 
Sect. 5. We conclude with an open problem at the final section. 



2 Traveling Salesman Problem with Few Inner Points 

Let P C IR^ be a finite point set in the Euclidean plane. The convex hull of P is the 
smallest convex set containing P. A point p G P is called an inner point if p lies in the 
interior of the convex hull of P. We denote by Inn(P) the set of inner points of P. A 
point p G P is called an outer point if it is not an inner point, i.e., it is on the boundary 
of the convex hull of P. We denote by Out(P) the set of outer points of P. Note that 
P = Inn(P) U Out(P) and Inn(P) n Out(P) = 0. Let n := |P| and k := |lnn(P)|. 
(So, we have |Out(P)| = n—k.) 



* A fixed-parameter algorithm has running time 0(/(fc)poly(n)), where n is the input size, k is 
a parameter and / : IN — > IN is an arbitrary function. For example, an algorithm with running 
time O(440''n) is a fixed-parameter algorithm whereas one with 0{n^) is not. 



270 



Vladimir G. Deineko et al. 



A tour on P is a linear order (xi , X 2 , . . . , x„) of the points in P. We say that this 
tour starts at x\. We often identify the tour {x \, . . . , x„) on P with a closed polygonal 
curve consisting of the line segments 'xix^, X 2 X 3 , . . Xn-iXn, XnX\. The length of the 
tour is the Euclidean length of this polygonal curve, i.e., d{xi, Xi+i) + d{xn, a^i), 

where d{xi,Xi+i) stands for the Euclidean distance from Xi to Xi+\. The objective of 
the TSP is to find a shortest tour. The following lemma was probably first noted by 
Flood [ 8 ] and nowadays it is folklore. 

Lemma 1 (Flood [ 8 ]). Every shortest tour has no crossing. 

This lemma immediately implies the following lemma, which plays a fundamental 
role in our algorithm. We call a linear order on Out(P) cyclic if every two consecutive 
points in the order are also consecutive on the convex hull of P. 

Lemma 2. In every shortest tour on P, the points o/Out(P) appear in a cyclic order. 

With Lemma 2, we can establish the following naive algorithm: take an arbitrary 
cyclic order on Out(P), then look through all tours (i.e., the linear orders) tt on P 
which respect^ this cyclic order. Compute the length of each tour, and output the best 
one among them. The number of such tours is 0(fc!n^). Therefore, the running time 
of this algorithm is So, if k is constant, this algorithm runs in polynomial 

time. However, it is not a fixed-parameter algorithm with respect to k since k appears 
in the exponent of n. 



3 First Fixed-Parameter Algorithm 

First, let us remark that later on we always assume that, when P is given to an algorithm 
as input, Out(P) is distinguished together with a cyclic order 7 = (pi, . . . ,p„_fc). 
Also, note that the space complexity in the algorithms below do not count the input 
size, as usual in theoretical computer science. 

Our first algorithm uses the following idea. We look through all linear orders on 
Inn(P). Let us first fix a linear order tt on Inn(P). We will try to find a shortest tour 
on P which respects both the cyclic order 7 on Out(P) and the linear order tt on 
Inn(P). Then, we exhaust this procedure for all linear orders on Inn(P), and output a 
minimum one. Later we will show that we can compute such a tour in O(fcn) time and 
0{k) space. Then, since the number of linear orders on Inn(P) is kl and they can be 
enumerated in amortized 0 ( 1 ) time per one with 0 (fc) space [17], overall the algorithm 
runs in 0 {klkn) time and 0 (fc) space. 

Now, given a cyclic order 7 on Out(P) and a linear order tt on Inn(P), we de- 
scribe how to compute a shortest tour among those which respect 7 and tt by dynamic 
programming. For dynamic programming in algorithmics, see the textbook by Cormen, 
Leiserson, Rivest and Stein [4], for example. 

We consider a three-dimensional array Fi[i,j,m], where i G {1, . . . ,n—k}, j G 
{0, 1, . . . , k}, and m G {Inn, Out}. The first index i represents the point in Out(P), 



^ For a set S and a subset S' C S', we say a linear order tt on S respects a linear order tt' on S' 
if the restriction of tt onto S' is tt' . 
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the second index j represents the point qj in Inn(P), and the third index m repre- 
sents the position. The value Fi[i,j,m] represents the length of a shortest “path” on 
{pi, . . . ,pi} U {gi, . . . , Qj} that satisfies the following conditions. 

- It starts at Pi S Out(P). 

- It visits exactly the points in {pi, . . . ,p^} U {gi, . . . , qj}. (If j = 0, set {gi, . . . , 
Qj} = 0 -) 

- It respects the orders 7 and tt. 

- If m = Out, then it ends at pi (an outer point). If to = Inn, then it ends at qj (an 
inner point). 

Then, the length of a shortest tour respecting tt and 7 can be computed as 

mm{Fi[n-k, k, Out] + d[pn-k,Pi], Fi[n-k, k, Inn] + d{qk,Pi)}- 

Therefore, it suffices to know the values Fi [j, j, to] for all possible j, j, to. 

To do that, we establish a recurrence. First let us look at the boundary cases. 

- Since we start atpi, set Fi[l, 0, Out] = 0. 

- There are some impossible states for which we set the values to 00 . Namely, for 

every j € {1, . . . , fc} set Fi [1, j, Out] = 00 , and for every i € n—k} set 

Fi[i, 0, Inn] = 00 . 

Now, assume we want to visit the points of {pi, . . . ,pi} U {gi, . . . , g^} while re- 
specting the orders 7 and tt and arrive at qj. How can we get to this state? Since 
we respect the orders, either (1) first we have to visit the points of {pi, . . . ,pi} U 
{gi, . . . , qj-i} to arrive atpi then move to qj, or ( 2 ) first we have to visit the points of 
{pi, . . . ,Pi} U {gi, . . . , qj-i} to arrive at qj_i then move to qj. Therefore, we have 

Fi[i,j, Inn] = min{Fi [i, j-1, Out] + d{pi,qj),Fi[i,j-l, Inn] +d{qj_i,qj)} (1) 

for (i,j) G {1, . . . , n—k} x {1, . . . , k}. Similarly, we have 

Out] = min{F’i[i-l, j, Out] -F d{p^-i,p^), Fi[i-1, j, Inn] -F d{qj,pi)} (2) 

for (i,j) G { 2 , . . . , n—k} x { 0 , . . . , k}, where d{qo,pi) is considered 00 for conve- 
nience. Since what is referred to in the right-hand sides of Equation (1) and (2) has 
smaller indices, we can solve this recursion in a bottom-up way by dynamic program- 
ming. This completes the dynamic-programming formulation for the computation of 

Fi[i,j,m]. 

The size of the array is {n—k) x (A:-Fl) x 2 = 0{kn), and the computation of each 
entry requires to look up at most two other entries of the array. Therefore, we can fill 
up the array in 0{kn) time and O(fcn) space. 

Now, we describe how to reduce the space requirement to 0{k). For each (i,j) G 
{1, . . . , n—k} X {1, . . . , k}, consider when Fi[i,j, Inn] is looked up throughout the 
computation. It is looked up only when we compute F’i[i,j-Fl, Inn] andFi[i-Fl, 7 , Out]. 
So the effect of Fi [i,j, Inn] is local. Similarly, the value Fi [i,j, Out] is looked up only 
when we compute j-Fl, Inn] and F’i[j-Fl, j, Out]. We utilize this locality in the 
computation. 
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We divide the computation process into some phases. For every i G n—k}, 

in the i-th phase, we compute Fi[j, j, Inn] and Out] for all j G {0, . . . , k}. 

Within the i-th phase, the computation start with Fi[i,l,Out] and proceed along 
Fi [i, 2, Out], Fi[i, 3, Out], . . ., until we get Fi [i, A:, Out], Then, we start calculating 
Fi [z, 1, Inn] and proceed along Fi[i, 2, Inn], Fi [i, 3, Inn], . . ., until we get [i, /c. Inn]. 
From the observation above, all the computation in the i-th phase only needs the out- 
come from the (i— l)-st phase and the i-th phase itself. Therefore, we only have to store 
the results from the (i— l)-st phase for each i. This requires only 0(k) storage. 

In this way, we obtain the following theorem. 

Theorem 1. The 2DTSP on n points including k inner points can be solved in 0{k\kn) 
time and 0(fc) space. In particular, it can be solved in polynomial time ifk = 0(log n/ 
log log n). □ 

4 Second Fixed-Parameter Algorithm with Better Running Time 

To obtain a faster algorithm, we make use of the trade-off between the time complexity 
and the space complexity. Compared to the hrst algorithm, the second algorithm has a 
better running time 0(2^fc^n) and needs a larger space 0(2^fcn). The idea of trade-off 
is also taken by the dynamic programming algorithm for the general traveling sales- 
man problem due to Bellman [3] and Held & Karp [10], and our second algorithm is 
essentially a generalization of their algorithms. 

In the second algorithm, first we fix a cyclic order 7 on Out(P). Then, we imme- 
diately start the dynamic programming. This time, we consider the following three- 
dimensional array ^ 2 ]*, S, r], where i G {1, . . . , n—k}, S C Inn(P), and r G SU {pi}. 
We interpret F 2 [i, S, r] as the length of a shortest path on {pi , . . . ,pi}U S that satisfies 
the following conditions. 

- It starts at Pi G Out(P). 

- It visits exactly the points in {pi , . . . , p^} U S'. 

- It respects the order 7 . 

- It ends at r. 

Then, the length of a shortest tour can be computed as 

min{F 2 [n-k, lnn(P),r] -|- d(r,pi) ] r G Inn(P) U {p„_fe}}. 

Therefore, it suffices to know the values ^ 2 ]*, S, r] for all possible triples (z, S, r). 

To do that, we establish a recurrence. The boundary cases are as follows. 

- Since we start atpi, set ^ 2 ]!, 0,Pi] = 0. 

- Set F 2 [1, S, r] = c» when S 7 ^ 0, since this is an unreachable situation. 

Let z G {2, . . . , n—k} and S C Inn(P). We want to visit the points of {pi, . . . ,pi}U 

5 while respecting the order 7 and arrive at p^. How can we get to this state? Since we 
respect the order 7 , we hrst have to visit the points of {pi , . . . , Pi-i } U S' to arrive at a 
point in S U {pz-i} and then move to pi. Therefore, we have 

F 2 [i,S,pi\= min {F 2 [i-l, S,t] + d{t,pi)) (3) 

tGSU{pi_i} 
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for i S {2, . . . , n—k} and S C Inn(P). Similarly, we have 

F 2 [i,S,r\= min S' \ W, f] + d(f, r)) (4) 

tG(S\{r})u{pi} 

for i G {2, . . . , n—k}, S C Inn(P), S ^ 0 and r G S. This completes the dynamic- 
programming formulation for the computation of F 2 [i, S, r] . 

The size of the array in this algorithm is (n—k) X^s=o (s)'® “ 0{2^kn), and the 
computation of each entry requires to look up 0(fc) other entries. Therefore, we can fill 
up the array in 0(2^fc^n) time and in 0{2^kn) space. Thus, we obtain the following 
theorem. 

Theorem 2. The 2DTSP on n points including k inner points can be solved in 0{2^ k'^n) 
time and 0{2^kn) space. In particular, it can be solved in polynomial time if k = 
O(logn). □ 

5 Variants of the Traveling Salesman Problem 

Since our approach to the TSP in the previous section is based on the general dynamic 
programming paradigm, it is also applicable to other variants of the TSP. In this section, 
we discuss two of them. 

5.1 Prize-Collecting Traveling Salesman Problem 

In the prize-collecting TSP, we are given an n-point set P C ]R^ with a distinguished 
point h G P called the home, and a non-negative number c(p) G K for each point 
p G P which we call the penalty of p. The goal is to find a subset P' C P\ {h} and a 
tour on P'U{h} starting at h which minimizes the length of the tour minus the penalties 
over all p G P' U {/i}. In this section, the value of a tour (or a path) refers to the value 
of this objective. 

For this problem, we basically follow the same procedure as the TSP, but a little 
attention has to be paid because in this case we have to select some of the points from 
P. In addition, we have to care about the position of h, in Inn(P) or in Out(P). 

First Algorithm. First, let us consider the case h G Out(P). In this case, we fix a 
cyclic order 7 on Out(P), which starts with h, and we look through all linear orders 
on Inn(P). Let 7 = {pi,p 2 , ■ ■ ■ ,pn-k), where pi = h, and fix one linear order tt = 
{qi,q 2 , ■ ■ ■ ,qk) on Inn(P). Then, we consider a three-dimensional array Fi[i,j,m], 
where i G {1, . . . , n—k}, j G {0, 1, . . . , fc} and m G {Inn, Out}. The value Fi\i,j, m] 
is interpreted as the value of an optimal path on jpi , . . . ,pi} U {qi, . . . , qj} which 
satisfies the following conditions. 

- It starts at Pi G Out(P). 

- It visits some points from jpi , . . . ,pi} U {qi, . . . , qj}, and not more. (If j = 0, set 
{qi,...,qj} = 0.) 

- It respects the orders 7 and tt. 

- If TO = Out, then it ends at pi. If to = Inn, then it ends at qj. 

We want to compute the values Pi [i,j, to] for all possible triples {i, j, to). 
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The boundary cases are: Fi[l,j, Out] = — c(pi) for every j G {1, . . . , k}-, and 
Fi [z, 0, Inn] = oo for every z G {1, . . . , n—k}, and the main part of the recurrence is: 

Fi[z,j, Inn] =min{ min (Fi[T, j-1, Out] + d(p*/, - c(g^)), 

min {Fi[ij',\nn] + d{qf,qj) - c{qj))} 

for (z, j) € n—k} x k}, and 

Fi[z, j,Out] =min{ min (Fi[i' , j , Out] + d{pi, , p,) - c{pi)) , 
min {Fi[i-l,j',\nn]+d{qj,,p,) - c{pi))} 

for (z, j) G {2, . . . , n—k} x {0, . . . , k}. For convenience, d{qQ,pi) is considered to be 
00 . 

Then, the value of an optimal prize-collecting tour respecting tt and 7 can be com- 
puted as 

mini min {Fi[i,k,0ut] + d{pi,pi)), min {Fi[n—k,j,\nn] + d{qj,pi))}. 

Since the size of the array is O(fczz) and each entry can be hlled by looking up 0(zz) 
other entries, the running time is 0{kn^). Therefore, looking through all linear orders 
on Inn(P), the overall running time of the algorithm is 0{klkn^). 

Next, let us consider the case h G Inn(P). In this case, we look through all linear 
orders on Inn(P) which start with h, and also all cyclic orders on Out(P). Fix one 
linear order tt = (gi, 92 , • ■ • , 9fc) on Inn(P), where qi = h, and one cyclic order 7 = 
{pi,P 2 , ■ ■ ■ ,Pn-k) on Out(P). Then, we consider a three-dimensional array Fi [i,j, m], 
where z G {0, 1, . . • , n—k}, j G {1, . . . , fc} and m G {Inn, Out}. The interpretation 
and the obtained recurrence is similar to the first case, hence omitted. However, in this 
case, the number of orders we look through is 0{kln). Therefore, the overall running 
time of the algorithm in this case is 0{klkn^). Thus, we obtain the following theorem. 

Theorem 3. The prize-collecting TSP in the Euclidean plane can be solved in 
O(klkn^) time and O(fcrz) space, when n is the total number of input points and k 
is the number of inner points. In particular, it can be solved in polynomial time if 
k = 0 (log n/ log log n). □ 

Second Algorithm. Now we adapt the second algorithm for the 2DTSP to the prize- 
collecting TSP. Let us consider the case h G Out(P). (The case h G Inn(P) can be 
handled in the same way.) For a cyclic order 7 = (pi, . . . ,Pn-k) on Out(P) with 
Pi = h, we define a three-dimensional array ^ 2 ]*, S, r] where z G (1, . . . , n—k}, S C 
Inn(P) and r G S' U {pi}. We interpret F 2 [i, S, r] as the value of an optimal path on 
jpi, . . . ,pi} U S that satishes the following conditions. 

- It starts at Pi G Out(P). 

- It visits some points of jpi , . . . ,pi}U S. 

- It respects the order 7 . 

- It ends at r. 
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Then, the value of an optimal tour can he computed as 

Tmii{F 2 [n—k, Inn(P), r] + d{r,pi) \ r G P}. 

The boundary cases are: T2[l,0,Pi] = — c(pi); F2[1,S', r] = oo when S ^ The 
main part of the recurrence is 

F 2 [i,S,pi\= min (F2[i-1, S', f] + - c(pi)) 

tGSU{pi_i} 

for i G {2, . . . , n—k} and S C Inn(P); and 

F 2 [i,S,r\= min (F 2 [i, S \ {r},t] + d{t,r) - c{r)) 

te{S\{r})U{pi} 

for i G {2, . . . , n—k}, S C Inn(P) and r G S. Then, we see that the computation can 
be done in 0(2^fc^n) time and 0(2^fcn) space. 

Theorem 4. The prize-collecting TSP in the Euclidean plane can be solved in 0(2^fc^n) 
time andO{2^kn) space, when n is the total number of input points and k is the number 
of inner points. In particular, it can be solved in polynomial time ifk = 0(log n). □ 

5.2 Partial Traveling Salesman Problem 

In the i-partial TSP^, we are given an n-point set P C with a distinguished point 
h G P called the home, and a positive integer i < n. We are asked to find a shortest 
tour starting from h and consisting of £ points from P. 

Because of space limitations, we do not give an adaptation of the first algorithm for 
the TSP, although it is possible but more tedious. So, we just describe a variation of the 
second algorithm. 

Second Algorithm. Similarly to the prize-collecting TSP, we distinguish the cases 
according to the position of h, in Inn(P) or in Out(P). Here we only consider the case 
h G Out(P). (The case h G Inn(P) is similar.) Fix a cyclic order 7 = (pi, . . . ,pn-k) 
on Out(P), where pi = h. We consider a four-dimensional array P2[i, S, r, m], where 
i G {1, . . . , n—k}, S C Inn(P), r G S'Ujpi}, and to G {1, . . . , £}. Then, F 2 [i, S, r, to] 
is interpreted as the length of a shortest path that satisfies the following conditions. 

- It starts at Pi G Out(P). 

- It visits exactly to points of {pi, . . . ,pi} U S'. 

- It respects the order 7. 

- It ends at r. 

Note that the fourth index to represents the number of points which have already been 
visited. Then, the length of a shortest tour through i points is 

min{P2[i, lnn(P),r,£] -F d{r,pi) | i G {1, . . . , n-k},r G Inn(P) U {p*}}. 

Therefore, it suffices to compute the values P2 [i, S, r, to] for all possible i,S,r, to. 



^ Usually the problem is called the A:-partial TSP. However, since k is reserved for the number 
of inner points in the current work, we will use £ instead of k. 
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The boundary cases are: F 2 [j, S', r, 1] = 0 if z = 1 and r = \ otherwise F 2 [i, S, r, 

1] = 00 . Also, F 2 [1, S, pi, m] = 00 for m > 1. The main part of the recurrence is: 

F 2 [i,S,pi,m\ = min {F 2 [i-l, S,t,m-l] + d{t,pi)) 

tGSU{pi_i} 

for i € {2, . . . , n—k}, S C Inn(P) and m € {2, 

F 2 [z,S, r, m]= min {F 2 [i, S \ {r},t,m—l] + d{t,r)) 

tG{S\{r})U{pi} 

for z € n—k}, S C Inn(P), r G S and m G {2, . . . , f}. 

Although the size of the array is 0{2^kin), we can reduce the space requirement 
to 0{2^kn) because of the locality with respect to the fourth index m. In this way, we 
obtain the following theorem. 

Theorem 5. The (-partial TSP for the Euclidean plane can be solved in 0{2^k^tn) 
time and 0{2^kn) space, where n is the total number of input points and k is the num- 
ber of inner points. In particular, it can be solved in polynomial time ifk = 0(log n). 



6 Concluding Remarks 

We have investigated the influence of the number of inner points in a two-dimensional 
geometric problem. Our results support the intuition “Fewer inner points make the prob- 
lem easier to solve,” and nicely “interpolate” triviality when we have no inner point and 
intractability for the general case. This interpolation has been explored from the view- 
point of parameterized complexity. Let us remark that the results in this paper can also 
be applied to the two-dimensional Manhattan traveling salesman problem, where the 
distance is measured by the £i-norm. That is because Lemmata 1 and 2 are also true 
for that case. More generally, our algorithms solve any TSP instance (not necessarily 
geometric) for which n—k points have to be visited in a specifled order. 

To the authors’ knowledge, this work is the first paper that deals with parameter- 
ized problems in terms of the number of inner points. Study of other geometric prob- 
lems within this parameterized framework is an interesting direction of research. Within 
this framework, the minimum weight triangulation problem was recently studied, and a 
fixed-parameter algorithm with respect to the number of inner points was given [11]. 

The major open question is to improve the time complexity 0{2^kfn). For example, 
is there a polynomial-time algorithm for the 2DTSP when k = 0(log^ n)? 
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Abstract. We design a faster algorithm for the all-pairs shortest path 
problem under the RAM model, based on distance matrix multiplica- 
tion (DMM). Specifically we improve the best known time complexity of 
0(n®(loglogn/logn)^'^^) to T{n) = 0(n® (log log n)^/ log n). We extend 
the algorithm to a parallel algorithm for DMM, whose time complexity is 
O(logn) and number of processors is T(n)/ log n. As an application, we 
show how to speed up the algorithm for the maximum subarray problem. 



1 Introduction 

In this paper we consider the all-pairs shortest path (APSP) problem, which 
computes shortest paths between all pairs of vertices of a directed graph with 
non-negative real numbers as edge costs. We present an algorithm that computes 
shortest distances between all pairs of vertices, since shortest paths can be com- 
puted as by-products in our algorithm. It is well known that the time complexity 
of (n, n)-distance matrix multiplication (DMM) is asymptotically equal to that 
of the APSP problem for a graph with n vertices, as shown in [1, pp 202-206]. 
Thus we concentrate on DMM in this paper. 

Fredman [4] was the first to break the cubic complexity of O(n^) under 
RAM, giving 0(n^(loglogn/ logn)^/^). Takaoka [8] improved this complexity to 
0(n^(loglogn/logn)^/^). In this paper we improve the complexity further to 
0(n^(loglogn)^/logn). The above mentioned complexities are all in the worst 
case. If we go to the average case, we can solve the APSP problem in 0(n^ log n) 
time [7] . If edge costs are small integers, the complexity becomes more subcubic, 
i.e., 0(n^~'^) for some e > 0, as shown in [9], [2], and [12]. 

We follow the same framework as those in [4] and [8]. That is, we take a 
two level divide-and-conquer approach. To multiply the small matrices resulting 
from dividing the original matrices, we sort distance data, and use the ranks of 
those data in the sorted lists. As the ranks are small integers, the multiplication 
can be done efficiently by looking at some precomputed tables. 

In Section 2, we describe the two level divide-and-conquer approach. In Sec- 
tion 3, we show how to use ranks to compute DMM. In Section 4, we describe 
the encoding scheme using the small integers of the ranks. In Section 5, we show 
how to compute the tables for the efficient multiplication. In Section 6, we show 
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how to construct DMM by table look-up. In Section 7, we analyze the computing 
time. In Section 8, we parallelize our algorithm for DMM. Section 9 discusses 
how to improve algorithms, sequential and parallel, for the maximum subarray 
problem based on previous sections. Section 10 concludes the paper. 

2 Review of Distance Matrix Multiplication 

The distance matrix multiplication is to compute the following distance product 
C = AB for two (n, n)-matrices A = [aij] and B = [bij] whose elements are real 
numbers. 



Cij = + bkj}, {i,j = 1, (1) 

This is also called funny matrix multiplication in some literatures such as [2]. 
The operation in the right-hand side of (1) is called distance matrix multiplica- 
tion, and A and B are called distance matrices in this context. This is also viewed 
as computing inner products defined for a = (oi, ..., a„) and b = (6i, ..., 6„) 

by 

ab = min'^^i{ak + bk} 

Now we divide A, B, and C into (m, m)-submatrices for N = n/m as follows: 



^1,1 ^i,Af \ I Bi i ... Bi ]\[ \ / Ci^i 

,An^i ... Am.n j \Bn,i ... Bm,n j \Cn,i 

Then C can be computed by 

Cij — vniifi^^.^\^AikBkj'\{i^ j — 1, 



Ci.w 

Cn.n , 



(2) 



where the product of submatrices is defined similarly to (1) and the “min” oper- 
ation is defined on submatrices by taking the “min” operation componentwise. 
Since comparisons and additions of distances are performed in a pair, we omit 
counting the number of additions for measurement of the complexity. We have 
multiplications of distance matrices in (2). Let us assume that each multi- 
plication of (m, m)-submatrices can be done in T(m) computing time, assuming 
precomputed tables are available. The time for constructing the tables is reason- 
able when m is small. Then the total time excluding table construction is given 

by 

0(r? jm -I- {nlm)^T{m)) 

In the subsequent sections, we show that T{m) = 0(m^(mlogm)^/^). Thus 
the time becomes 0(n^(log 

Now we further divide the small (m, m)-submatrices into rectangular matri- 
ces in the following way. We rename the matrices Aik and Bkj in (2) by A and B. 
Let M = m/l, where 1 < I < m. Matrix A is divided into M (m, Z)-submatrices 
Ai, ..., Am from left to right, and B is divided into M (Z, m)-submatrices Bi, 
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Bm from top to bottom. Note that Ak are vertically rectangular and Bk are 
horizontally rectangular. Then the product C = AB can be given by 

C = (3) 

The values of m and I will be determined in subsequent sections to achieve 
the claimed complexity. 



3 How to Multiply Rectangular Matrices 

We rename again the matrices Ak and Bk in (3) by A and B. In this section we 
show how to compute AB, that is, 

min’'^^i{air + brj} , for z = 1, ..., m; j = 1, ..., m. (4) 

We assume that the lists of length m 

{air - ais, ..., Umr ~ Oms), {1 < T < S < 1) 



and 



(6si brl, ..., bsm brm)j (1 ^ r < S ^ 



are already sorted for all r and s such that 1 < r < s < ?). The time for sorting 
will be mentioned in Section 7. Let Ers and Frs be the corresponding sorted lists. 
For each r and s, we merge lists Ers and Ers to form list Grs- Let Hrs be the list 
of ranks oi air — a^s (z = 1, ..., m) in Grs and Lrs be the list of ranks of bsj — brj 
{j = in Grs- Let Hrs[i] and Lrs[j] be the zth and jth components of 

Hrs and Lrs respectively. Then we have 



Lfrs [-^rs [t]] a^r ^ist Lrrs [-Lrs [j]] bsj brj 

The lists Hrs and Lrs for all r and s can be made in 0{Pm) time, when the 
sorted lists are available. 

We have the following obvious equivalence. 



brj "El ^is bsj k CZzr ^zs Li bsj brj 

and 

air 0-is "E: bsj brj ^ -Lrs[j] 

Fredman [4] observed that the information of ordering for all z, j, r, and s 
in the righthand side of the above formula is sufficient to determine the product 
AB by a precomputed table. This information is essentially packed in the three 
dimensional space of Hrs[i]{i = l,...,m;r = l,...,/;s = r+1, ...,/), and Lrs[j]{j = 
1, ..., m; r = 1, ..., 1; s = r + 1 , ..., /). We call this the three-dimensional packing. 

Takaoka [8] proposed that to compute each (z, j) element of AB, it is enough 
to know the above ordering for all r and s. We call this the two-dimensional 
packing. Note that the precomputed table must be obtained within the total 
time requirement. The two-dimensional packing will therefore allow a larger size 
of TO, leading to a speed-up. 

In this paper, we show that a one-dimensional packing scheme is possible. 
To choose the minimum of 
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Xj, — Ctir ~h brj , (r — 1, 

we go through a tournament with I participants. We assume m and I are a 
power of 2. Our theory can be generalized into other cases easily. Specifically we 
compare in the following pairs. We call this the 0th comparison stage. 

(rrO f'rO T-Oi ( 'rO'l 



We call the minimum in each pair the winner of the pair. Suppose the minima 
of those pairs are x\,X 2 , Each Xj has two possibilities of being X 2 j_i or 

X 2 j- Then we compare in the following pairs. We call this the 1st comparison 
stage. 



{x\,x\),{xl,x\),...,{x]/2.-l 




The winner of (xlj_i,xlj) has four possiblities of being for 4j — 3 < i < 4j. 
If we repeat these stages, we can finish in log I — 1 stages. Comparing in the pair 
that is, testing is equivalent to comparing in the pair 

{Hr,r+i[i\i Lr^r+i[j])- Thus if we pack two tuples (i?i2[f], i?34[t], ..., and 

(Ti 2 [j], T 34[j], ..., into single integers, we can know the 1/2 winners in 

0(1) time from a precomputed table in the form of an encoded integer. We 
can take a similar approach for stages 1, 2, ... Thus the time for computing (4) 
becomes O(logZ). The tables for the fcth stage are constructed for all possible 
remaining 1/2^ winners, each of which is one of 2^ participants. The number of 
necessary tables at the kth stage is given by (2^)^/^ . See Fig. 1 for fc = 1. Thus 
the total number of necessary tables, named by T later, is 



1 + 2'/2 -b 4'/4 -b 8'/® -b ... + {l/2f < 2'/2log; 



Since the ranks are between 1 and 2m, the size of each table is bounded by 
(2m)*/^(2m)*/^ = (2m)*. 

The time for computing those tables will be mentioned in Section 7. In the 
following, winners are specified by indices. 

Example 1. Let and Lrs\j] be given in Figure 1. First we have the two 

lists 



(i7l,2M,i73,4[z],i^5.6W,i^7.8W) = (1,5,12,3) 
(Ti,2b1,L3,4b'],T5.6b'],T7.8b1) = (11,15,10,10) 

Since 1 < 11,5 < 15, 12 > 10, and 3 < 10, the winners at the first stage are (1, 
3, 6, 7). The next lists are thus 

(i7i,3[f],i76,7W) = (13,4), (Ly3b],L6.7b1) = (3,7) (shown by asterisks) 

Since 13 > 3 and 4 < 7, the winners at the second stage are (3, 6). The last lists 
are (iLs^eW) = (1) and (Ls^eb]) = (3), from which we conclude 0,3 + bzj is the 
minimum. All tuples above are encoded in single integers in actual computation. 
If we had different H and L, we might have other winners such as (2, 4, 5, 8) at 
the first stage, and (1, 7) at the second stage, etc. All posibilities are shown by 
asterisk or minus signs for fc = 1. We have all preparations for those situations 
in the form of precomputed tables, so each step is done in 0(1) time. 
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H 12345678 



L 12345678 



1 

2 

3 

4 

5 

6 

7 

8 



1 13* 5- 6 7 3 4 1 

12- 4- 5 8 2 5 2 

5 6 1 10 7 3 

2 4 5 1 4 

12 3- 9- 5 

4* 8- 6 

3 7 

8 



11 3* 2- 7 4 5 6 

4-1-5 6 7 8 

15 3 8 1 5 

11 3 7 14 

10 5- 6- 

7* 9- 
10 



Fig. 1. Hra[i] and Lrs\j] for / = 8 



4 Encoding Lists into Integers 

The sequence of k integers Xk-i, •••, x\, xq, where 0 < Xi < — 1 for an integer 

/i, is encoded into an integer by the function such that 

h^(xk-i, ■■;Xi,xo') = Xk-i/J.^~^ + ... + xifi + xo 

It is obvious that the mapping is a bijection. The function can be computed 
in 0{k) time by Horner’s method, assuming that arithmetic operations on inte- 
gers up to /i^ can be done in 0(1) time. We assume later that /x and k are small, 
so that those operations are done in 0(1) time because can be contained in a 
single computer word. In the following we omit the superscript k wherever it is 
clear from the context. We note that the inverse function h~^, that is, decoding, 
can be computed in 0(fc) time by succesive division. If h{xk-i, ■■■,xi,xo) = z, 
we define h~^{z) = {xk-i, ■■■,xi,xo)- We express the jth element of h~^{z) by 
h~^{z)[j], that is, h~^{z)[j] = Xk-j- We express h by hi when we set = I to 
use hi for encoding winners given in indices. We also define mapping h 2 from 
h by setting /i = 2m. Mapping /12 is used for encoding ranks H's and L's. Let 
zi, Z 2 , ■■■, Zr be r winners where r is an even integer. We define mappings / and 
g for 1 < r < ? to determine which ranks to use next after we have those r 
winners. We omit the subscripts i and j from Hrs[i] and Lrs[j] for simplicity in 
the following. 



f {Zl, Z 2 , ■■■, Zr) — {Hzi,Z 2J ■■■■! Hzr-l,Zr) 

Z 2 t ■■■T Zr) = {Lz\,Z 21 ^Z3,Z41 ■■■1 Lzr-l,Zr) 

Those mappings can be computed in 0{r) time. 



5 How to Compute the Tables 

In this section we construct tables T°, T^, .... The role of table is to determine 
winners (a;i, ..., X;/ 2 '^+i ) for th® next stage from the fc-th stage. Let integers a 
and f3 represent encoded forms of ^/2 ranks in H’s and L’s. The role of z is to 




A Faster Algorithm for the All-Pairs Shortest Path Problem 283 



represent winners in an encoded form. At the beginning all are winners. The 
table is defined by 

T°(z, a, P) = hi{xi,X 2 , X 1 / 2 ), 

where xj = 2j — 1 if xj = 2 j otherwise, and z = 

hi(l, 2, ..., /). The value of z has just one possibility. Tables can be 

defined similarly using 1/2 winners, 1/4 winners, ... as follows: 

Let a and /? be encoded forms of 1/4 ranks each and z = hi{z\, Z//2) such 
that 2 j — 1 < Zj < 2j. The value of z has 2*/^ possibilities. is defined by 

T'^{z,a,P) = hi{xi,X 2 ,.:,xi/ 4 ), 

where Xj = Z2j-i if h 2 ^{a)[j] < Xj = Z 2 j otherwise. 

Let a and /? be encoded forms of 1/8 ranks and z = /ii(zi, ..., Z//4) such that 
4 j — 3 < Zj < 4j. The value of z has 4*/^ possibilities. is defined by 

T'^{z,a,P) = hi{xi,X 2 ,-,xi/ 8 ), 

where Xj = Z 2 j-i if xj = Z 2 j otherwise. 

Let a and /? be two ranks, and z = /ii(zi,Z2) such that 1 < Zi < 1/2 and 
1/2 + 1 < Z 2 < 1. The value of z has (1/2)^ possibilities. 

r^°g*“i(z,a,/3) = hi(x) = X, 

where x = Zi if a < P, and x = Z 2 otherwise. 

In these definitions, z indexes different tables, and thus subscripting may be 
more suitable, but we put z in the argument part for notational convenience. 
We also define tables R' and S"’ for 0 < r < ^ by 

R{z) = h 2 {f{h/\z))) , S^{z) = h 2 {g{h/\z))), 

where h/^{z) = (zi, ..., z^). Those tables are precomputed for all possible z. The 
role of R and S'" is to determine which ranks to use for the next stage when 
the r winners are given. 

To compute R{z,a,P) = hi{xi,X 2 , ■■■,xij 4 ), for example, we decode z, a, 
and P in 0 {l) time, then test h 2 ^{a)[j] < h 2 ^{P)[j] to get x\, ...,x//4, and finally 
encode Xi,...,x;/4 spending 0{l) time. We do this for all possible z, a, and p. 
Other tables can be computed similarly in 0{l) time. Tables R and S'’’ can also 
be computed in 0{l) time. The total time for Rs and S’s are not greater than 
that for T’s. 

The total time for computing those tables is thus 0(2*/^ log l{2m)H). Observe 
0 { 2 ’‘^'^\ogl{ 2 m)H) = 0(c"°g™), for some constant c > 0. 

6 Algorithm by Table Look-Up 

Using tables we can compute minr{air + Kj} by repeating the fol- 
lowing log? steps. We start from R(z^) = /i2(Lti,2, and S*(z°) = 

h 2 {Li, 2 , L 3 , 4 , ■■■, Li-ij). The last z'°®* is the solution index. 
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= hi{l,2, (z° was precomputed) 

^1 ^ T^{z°,R\z°),S\z°)) 
z^ = T\z\R^/\z^),S^/\z^)) 



Example 2. The 0-th stage in Example 1 is illustrated as follows: 

2° = /ri(l,2,3,4,5,6,7,8) 

=T°{z°,R^{z°),S^{z°)) 

= T^{z°, /i2(l, 5, 12, 3), 15, 10, 10)) 

= /ii(l,3,6,7) 

i?®(z°) = /i2(77i,2, 773 . 4 , 775.6,^7.8) = /i2(l, 5, 12, 3), 
s^{z^) = ft-2(Ti.2, 7^34, 7 j5_6, 7^7, s)) = ^2(H, 15, 10, 10)). 

In the above /i 2 (l, 5, 12, 3), ft. 2 (H, 15, 10, 10)), hi{l, 3, 6, 7), etc., are to show 
the contents of tuples. They are actually treated as single integers. The argu- 
ments of Ii 2 are ranks, and those of hi are winners. 

The time for this algorithm by table look-up is 0{m‘^ log 1) = 0{m? logm). 
Since there are m/l products A^Bk, we need 0{{m/l)w? log m) time. To compute 
minimum component-wise in min^^\{AkBk\, we need 0{{m/l)m^) time. We 
also need 0{MPm) = 0{lrn?) time for MP mergings as described in Section 3. 

We set I = (to log m) ^7^ to balance the first and the third of the above 
three complexities. Then to compute the product of (to, TO)-matrices, we take 
0{m‘^{mlogm)^^^) time. 

7 How to Determine the Size of Submatrices 

We determine the size of submatrices in Section 2. Let to be given by 
TO = log^ n/ (log^ c(loglogn)^). 

Then we have I < log n/(log clog log n) for sufficiently large n, and the 0(c^^°®"‘) 
time for making the tables can be shown to be 0(n). Substituting the value of to 
for 0(n^(logTO/TO)^7^), we have the overall computing time for distance matrix 
multiplication as 0(n^(loglogn)^/logn). 

We note that the time for sorting to obtain the lists Ers, and Frs for all k 
in Section 3 is 0{MPm\ogm) = 0{m?'^\ogm). This task of sorting, which we 
call presort, is done for all Aij and 73^ in Section 2 in advance, taking 

0{{n log m) = 0(n^ log n/(log log n)^7^)) 

time, which is absorbed in the main complexity. 
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8 Parallel Algorithms 

In this section we design a parallel algorithm for DMM, and then mention how 
to apply the parallel algorithm to the APSP problem. We use a PRAM with 
EREW memory. For parallel computational models, see [1]. The computation of 
DMM consists of two parts. 

(I) Computation of component-wise minima of N matirces. 

(II) Computation of products AikBkj, {i,j, k = 1, N). 

It is well known that there are PRAM-EREW algorithms for finding the min- 
imum of N numbers and broadcasiting a datum to N locations with 0{N / log N) 
processors and 0{\ogN) time. See [6], for example for these algorithms. Let us 
discuss a parallel algorithm for (I) first. Applying the above with N = n/m, we 
have a parallel algorithm for (I) with 0(n^(loglogn)^/log^n) processors and 
O(logn) time, since minima can be computed in parallel. 

Now we design a parallel algorithm for (II). In the computation of AfcRfc, 
there are 0{P) independent mergings of lists of length m. Such computation 
is done for k = in parallel. There are also M — I “min” operations 

on (to, TO-)-matrices. There is a parallel algorithm for those operations with 
0{m‘^l/ logm) = 0(TO^(TOlogTO)^/^/logn) processors and 0{logm) time. The 
tasks of encoding and table look-up computation can be done by a parallel algo- 
rithm withP = 0(TO^(TOlogTO)^/^/logTO) andT = 0{logm). Since products 
in (II) can be computed in parallel, we have a parallel algorithm for (II) with P 
processors and T time, where to = log^ n/(loglogn)^, and 

P = 0{{n/m)^m^{mlogmY^^/logm) = 0(n^ log log n/ log n) 

T = O(logTO-) = O(loglogn) 

Since the processor phase with a larger number of processors can be simulated 
by a smaller number of processors, we can say there is a parallel algorithm for 
DMM with 



P = 0(n^(loglogn)^/(logn)^) 

T = O(logn) 

C = 0(n^(loglogn)^/logn) 

The previous results in [4] and [8] are used in [5] and [10] to establish a parallel 
algorithm for the APSP problem with subcubic cost and 0(log^ n) time. The 
results in this section will improve those results in a similar way. 

9 Application to the Maximum Subarray Problem 

Now we proceed to the maximum subarray problem for an array of size (n,n). 
The cubic algorithm for this problem given by Bentley [3] was improved to sub- 
cubic by Tamaki and Tokuyama [11]. We review the simplified subcubic version 
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in [10]. To ease the description of our algorithm, we give a two-dimensional ar- 
ray a[l..m, l..n] as input data. The maximum subarray problem is to maximize 
the array portion a[k..i,l..j], that is, to obtain such indices (k,l) and (z,j). We 
suppose the upper-left corner has co-ordinate (1,1). 



Example 3. Let a be given by 



1 


2 


-3 


5 


-4 


-8 


3 


-3 


2 


-4 


-6 


-8 


2 


-5 


4 


1 


3 


-2 


9 


-9 


13 


61 


-5 


2 


1 


-3 


5 


-7 


18 


-21 


2 


-6 



Then the maximum subarray is given by the rectangle defined by the upper 
left corner (3, 5) and the lower right corner (4, 6), given by vertical bars. 

We assume that m < n without loss of generality. We also assume that m and 
n are powers of 2. We will mention the general case of m and n later. Bentley’s 
algorithm finds the maximum subarray in 0{m?n) time, which is cubic when 
m = n. 

The central algorithmic concept in this section is that of prefix sum. The 
prefix sums sum[l..n] of a one-dimensional array a[l..n] is computed by 

SMm[ 0 ] := 0 ; 

for f := 1 to n do sum[i] := sum[i — 1] -k a[t]; 

This algorithm can be extended to two dimensions with linear time, the 
details of which are omitted. 

We use distance matrix multiplications of both min and max versions in 
this section. We compute the partial sums s[i,j] for array portions of l..j] 

for all i and j with boundary condition s[t,0] = s[0,j] = 0. These sums are 
often called the two-dimensional prefix sums of a. Obviously this can be done 
in 0{mn) time. The outer framework of the algorithm is given below. Note that 
the prefix sums once computed are used throughout recursion. 

Algorithm M: Maximum subarray 

1. If the array becomes one element, return its value. 

2. Otherwise, if m > n, rotate the array 90 degrees. 

3. Thus we assume m < n. 

4. Let Aieft be the solution for the left half. 

5. Let Aright be the solution for the right half. 

6. Let Acoiumn be the solution for the column-centered problem. 

7. Let the solution be the maximum of those three. 

Here the column-centered problem is to obtain an array portion that crosses 
over the central vertical line with maximum sum, and can be solved in the 
following way. 
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In the above we first fix i and k, and maximize the above by changing I and j. 
Then the above problem is equivalent to maximizing the following for i = 1, m 
and k = 1, i — 1. 

Acolumnli, k] = j]} 

Let s*[i,j] = — s[j, f]. Then the above problem can further be converted into 
Acoiumn[i, k] = {s[i, 1] + s*[Z, k]} + j] + s*[j, fc]}(5) 

The first part in the above is distance matrix multiplication of the min 
version and the second part is of the max version. Let and S 2 be matrices 
whose (i,j) elements are j — 1] and s[i,j + n/2]. For an arbitrary matrix T, 
let T* be that obtained by negating and transposing T. Then the above can be 
computed by multiplying S\ and by the min version and taking the lower 
triangle, multiplying S 2 and S'! by the max version and taking the lower triangle, 
and finally subtracting the former from the latter and taking the maximum from 
the triangle. 

For simplicity, we apply the algorithm on a square array of size (n, n), where 
n is a power of 2. Then all parameters m and n appearing through recursion in 
Algorithm M are power of 2, where m = n or m = n/2. We observe the algorithm 
splits the array vertically and then horizontally. We define the work of computing 
the three Acoiumn’s through this recursion of depth 2 to be the work at level 0. 
The algorithm will split the array horizontally and then vertically through the 
next recursion of depth 2. We call this level 1. 

Now let us analyze the time for the work at level 0. We can multiply (n, n/2) 
and (n/2,n) matrices by 4 multiplications of size (n/2, n/2), and there are two 
such multiplications in (5). We measure the time by the number of comparisons, 
as the rest is proportional to this. Let M{n) be the time for multiplying two 
{nl2,nl2) matrices. At level 0, we obtain an Acoiumn and two AcoiumnA, spend- 
ing 12M(n) comparisons. Thus we have the following recurrence for the total 
time T{n). 

T(1) = 0, T(n) = 4T(n/2)-hl2M(n). 

Theorem 1. Let c he an arbitrary constant such that c > 0. Suppose M{n) 
satisfies the condition M(n) > (4 -|- c)M(ji/2). Then the above T(n) satisfies 

T{n) < 12{1 + A/ c)M{n). 



See [10] for proof. 

Clearly the complexity of 0(n^(log log n)^/ log n) for M (n) satisfies the condition 
of the theorem with some constant c > 0. Thus the maximum subarray problem 
can be solved in 0(n^(loglogn)^/logn) time. Since we take the maximum of 
several matrices component-wise in our algorithm, we need an extra term of 
O(n^) in the recurrence to count the number of operations. This term can be 
absorbed by slightly increasing the constant 12 in front of M{n). 
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If we start from an (m, n) array with m < n, the recursion proceeds without 
a 90-degree rotation until we hit m = n. This part will increase the complexity 
by a factor of log(n/m). 

Suppose n is not given by powers of 2. By embedding the array a in the array 
of size {n' ,n') such that n' is the next power of 2 and the gap is filled with 0, we 
can solve the original problem in the complexity of the same order. 

Now we design a parallel algorithm for the maximum subarray problem based 
on the previous sections, using the PRAM-EREW model. We see all the problems 
of Acoiumn defined through Algorithm M can be computed in parallel. We define 
a region to be a square array portion defined through recursion. At level 0, there 
is one region, 4 at level 1, 16 at level 2, etc. We prepare a processor for each region, 
which we call a task processor. The task of the task processor is to organize the 
computation of the three Acoiumn s defined for each square region similarly to 
level 0. We can use Algorithm M as a parallel device to transmit the information, 
that is, the co-ordinates, of the region for each task processor in parallel, that is, 
through lines 4 and 5. After the transmission is completed, each task processor 
will copy necessary data from array s by employing vn? copying processors, where 
the region is an (to, to) array for some to, and then use the parallel algorithm 
described in the previous section, mobilizing the necessary number of processors 
under its control. Note that each element of array s is seen by 0(log n) copying 
processors. After all the Acoiumn are finished, each task processor can report its 
finding through Algorithm M towards the top level, spending 0(log n) time. The 
time for transmission is obviously within O(logn). The time for computing all 
Acoiumn is dominated at the top level, which is O(logn). Thus the total time is 
0(log n). 

We analyze the total cost. At the top level, we require P{n) = O{n^{\og\ognf / 
(logn)^). At the 1st level, P{n/2) prcessors are used by a task processor, and 
there are 4 task processors, etc. Thus the total number of processors, P{n), is 
given by 

0(n3(loglogn)Vlog" n) + 0(4(n/2)3(loglog(n/2))V log"(n/2)) 
-kO(16(n/4)3(loglog(n/4))Vlog^(n/4)) -k ... 

= 0(n^(loglogn)^/log^ n) 

That is, the total cost 0(P(n) log n) is asymptotically equal to that of the se- 
quential algorithm. 

It is well known that the prefix sums for a one-dimensional array of size n can 
be done in O(logn) time with n/logn processors using a PRAM-EREW. See 
[6] for the algorithm. If we apply this algorithm in two dimensions, we can have 
prefix sums in O(logn) time with 0(n^/logn) processors in the same computa- 
tional model. These complexities are absorbed in the above main complexities. 

10 Concluding Remarks 

We showed an asymptotic improvemenet on the time complexity of the all-pairs 
shortest path problem, and paralleized it. The results will have consequences for 




A Faster Algorithm for the All-Pairs Shortest Path Problem 289 



application areas where DMM or the APSP problem is used. As an example, 
we showed that the maximum subarray problem can be solved in the same 
complexity as that of DMM. 

There is a room for improvement of the exponent of the factor log log n in 
the complexities being lowered down in future research. 

Part of this work was done while the author was on leave at Osaka University. 
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Abstract. The Quota Traveling Salesman Problem is a generalization 
of the well known Traveling Salesman Problem. The goal of the traveling 
salesman is, in this case, to reach a given quota of the sales, minimizing 
the amount of time. In this paper we address the on-line version of the 
problem, where requests are given over time. We present algorithms for 
various metric spaces, and analyze their performance in the usual frame- 
work of the competitive analysis. In particular we present a 2-competitive 
algorithm that matches the lower bound for general metric spaces. In the 
case of the half-line metric space, we show that it is helpful not to move 
at full speed, and this approach is also used to derive the best on-line 
polynomial time algorithm known so far for the more general On-Line 
TSP problem (in the homing version). 



1 Introduction 

Let us imagine that a traveling salesman is not forced to visit all cities in a 
single tour but in each city he can sell a certain amount of merchandise and his 
commitment is to reach a given quota of sales, by visiting a sufficient number 
of cities; then he is allowed to return back home. The problem to minimize 
the amount of time in which the traveling salesman fulfills his commitment is 
known as the Quota Traveling Salesman Problem (QTSP for short, see [4,9] for 
a definition of the problem) and it is also called Quorum-Cast problem in [11]. 
Such problem can be seen as a special case of the Prize-Collecting Traveling 
Salesman Problem (PCTSP^, [6]) in which again the salesman has to fulfill a 

* Work of Giorgio Ausiello and Luigi Laura is partially supported by the Future and 
Emerging Technologies programme of the EU under contract number 1ST- 1999- 
14186 (ALCOM-FT); and by “Progetto ALINWEB: Algoritmica per Internet e per 
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^ Note that some authors use to name PCTSP the special case that we have called 
QTSP [5,8]. 
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quota but now nonnegative penalties are associated to the cities and the cost of 
the salesman tour is the sum of the distance travelled and the penalties for the 
non visited cities. QTSP corresponds to the case of PCTSP in which all penalties 
are 0. A special case of QTSP is the case in which the amount of merchandise 
that can be sold in any city is 1; this case coincides with the so called fc-TSP 
problem, i.e. the problem to find the minimum tour which visits k cities among 
the given ones. Moreover this problem is related with the fc-MST problem, that 
is the problem to find the minimum tree which spans k nodes in a graph. Clearly, 
if all weights are equal to 1 and the quota to be achieved is equal to the number 
of cities, the problem corresponds to the classic TSP. 

The QTSP problem and the other problems mentioned above have been thor- 
oughly studied in the past from the point of view of approximation algorithms. 
In particular, for the fc-TSP problem and for the /c-MST problem a polynomial 
time algorithm with an approximation ratio 3 has been shown in [15] while, for 
fc-MST problem, in the non rooted case, an algorithm with ratio 2.5 has been 
shown in [1]. For the general PCTSP problem a polynomial time algorithm with 
polylogarithmic performance guarantee has been given in [4, 5] . 

In this paper we wish to address the on-line version of the QTSP problem, 
named OL-QTSP . On line versions of other routing problems such as the trav- 
eling salesman problem [3], the traveling repairman problem [13, 16, 18], variants 
of the dial-a-ride problem [2, 13], have been studied in the literature in the re- 
cent years. The most updated results regarding these problems can be found 
in [17]. In the on-line version of QTSP we imagine that requests are given over 
time in a metric space and a server (the traveling salesman) has to decide which 
requests to serve and in what order to serve them, without yet knowing the 
whole sequence of requests, with the aim of fulfilling the quota by traveling the 
minimum possible amount of time. As it is common in the evaluation of on-line 
algorithms [14], the performance of the algorithm (cost of serving the requests 
needed to fulfill the quota), is matched against the performance of an optimum 
off-line server, that is a traveling salesman that knows all the requests ahead of 
time and decides which requests to serve and in what order, in order to fulfill 
the assigned quota. Clearly the off-line server cannot serve a request before its 
release time. The ratio between the former and the latter is called competitive 
ratio of the on-line algorithm (see [10]). 

In the rest of the paper we first provide a formal definition of the problem 
and we introduce the corresponding notation. Subsequently, in Section 2 we 
provide the proof that no algorithm for the OL-QTSP problem can achieve a 
competitive ratio smaller than 2 in a general metric space, and we also give a 
simple 2-competitive (hence optimal) on-line algorithm. In Section 3 we discuss 
the case in which the metric space is the halfline. In such case we introduce an 
algorithm which achieves a competitive ratio 3/2 by not moving at full speed. 
We also show a matching lower bound. In Section 4 we apply the same ‘slow is 
good’ paradigm used in Section 3, to the design of an on-line algorithm for the 
OL-TSP and in such way we derive the best on-line polynomial algorithm known 
so far, whose competitive ratio is 2.78. In the Conclusion section we illustrate 
some possible developments and open problems. 
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1.1 Statement of the Problem 

Let us define the OL-QTSP problem in a formal way. Let M be a general metric 
space, let us denote with O the origin of M, and let ri . . . be an ordered 
sequence of requests in M, any request being a triple {ti,pi,Wi) where ti is 
the release time of the request, Pi is a point in the metric space where the request 
is released and Wi is the weight of the request. When we refer to the non- weighted 
problem a request is simply a couple (ti,pi). Note that the sequence is ordered 
in the sense that if i < j then ti < tj . The OL-QTSP problem is the problem in 
which an on-line server (traveling salesman) is supposed to serve at least a quota 
Q out of the overall amount of requests come back to the origin 

in the minimum amount of time. In the non- weighted version of the problem 
the quota Q is simply the minimum number of requests to be served. Clearly no 
request can be served before its release time. 

In the following we will denote by OL the On-Line Server, and by OPT 
the server moved by the optimum off line server (adversary). Besides, for any 
sequence of requests cr, by Z^^{a) and Z*{a) we denote, respectively, the com- 
pletion time of OL and OPT over the sequence cr; when there are no ambiguities, 
we write Z^^ and Z* instead of Z^^{a) and Z*{a). 

Furthermore, with p^^{t) and p*{t) we denote respectively the position of 
OL and OPT at time t, and with d{pi,pj) we represent the distance between 
the points Pi and pj in the metric space M . 

2 General Metric Spaces 

2.1 Lower Bound of the Problem 

We first show the following result, that will be extensively used in the rest of 
the paper and referenced as the speed constraint. 

Lemma 1. No p- competitive algorithm, in any metric space, at time t can have 
its server in a position distant from the origin more than t ■ {p — 1). 

Proof. We simply show that any algorithm that violates the constraint has a 
competitive ratio bigger than p. Assume that time t\ is the first instant in which 
the above constraint is not respected, i.e. d{p^^{ti),0) > ti ■ {p — 1). Then, in 
the same time, the adversary releases a set of requests with overall quota Q in 
the origin, and these are served immediately by the adversary (Z* = ti). The 
completion time of the on-line server is not less than > ti-{p—l)+ti = p-t\, 
and this means that the competitive ratio is strictly bigger than p\ > 




Just as the TSP is a special case of QTSP , it is easy to observe that the 
OL-QTSP (in the Homing version, see [3]) is a special case of OL-QTSP . From 
this point of view from [3] we may trivially derive a lower bound of 2 on the 
competitive ratio for OL-QTSP . We now prove a stronger result that provides 
a lower bound for the competitiveness of any on-line algorithm for OL-QTSP in 
a general metric space, even when the quote Q is bounded by a constant. 
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Theorem 1. There is no p- competitive algorithm for the OL-QTSP on the line 
with p < 2, even with a maximum of 3 requests released. 

Proof. Assume the metric space is the real line. The quota Q is equal to 2. 
Denote by A and B the abscissa points -1 and -1-1 on the real line, respectively. 
At moment t = 0, a request r_ is released on A and a request r+ is released 
at B] then, no request is released until moment t = 1. 

Observe first that € [—1,1], but, from Lemma 1, we know that at 

time 1 OL cannot be in —1 or 1 so G (—1, 1). Then, for x G [0, 1], define 

f(x) = min{d(A,p‘^^(l + x)), d(B,p‘^^(l + a;))} and g(x) = f{x) — x. Clearly, g 
is continuous if the server moves in a continuous way. Also, g(0) = /(O) > 0 and 
g(l) = /(I) — 1 < 0, since p^^(2) G (—2,2). Hence, there exists x G [0, 1] such 
that g(x) = 0, i.e., f(x) = x. From now on let xq be the minimum such x. 

Assume first that f{xo) = d{B,p^^{l + a;o)) = xq (this means that p^^{l + 
xo) G {1 — a;o, l+a;o}). Actually, due to Lemma 1, only one of the two possibilities 
holds and hence: p'^^{l + X[j) = 1 — a;o. In this case, if the third (last) request rs 
is released in — 1 -|- a;o at moment t = 1 -I- xo, then the optimum is equal to 2 
(request r_ is served at t = 1, r^ at 1 -bxo, and return to the origin is at time 2). 
On the other hand, since p'^^{l + xq) = 1 — xq, then then the server can serve 
either r+ and rs, or r_ and r^ in time equal to (1 -I- xq) -I- a;o + 1 + 2(1 — xg) = 4. 
In both of the above cases, the OL server cannot guarantee a competitive ratio 
better than 2. 

Assume now f(xo) = d(A,p'^^(l + xg)) = xg. In this case, considering a 
request rg released in 1 — xq at moment t = 1 -l-xo, arguments exactly analogous 
to the previous ones imply a competitive upper bound of 2. The proof of the 
theorem is now complete. □ 

Note that the above lower bound is achieved with Q = 2, the total number of 
requests released is 3 and two requests are released at time 0: only one request 
was released on-line. Obviously, the result of Theorem 1 does not work for the 
case Q = 1 and also not for the case in which Q = 2 and only two requests are 
released. These cases are addressed in Subsection 2.3 

2.2 A 2-Competitive Algorithm 

The following algorithm is 2-competitive for the OL-QTSP problem in any metric 
space: 

Algorithm 1 (Wait and Go (WaG)) For any sequence a already presented 
to the server, with at least a quota Q available over all the requests in a, the 
algorithm computes Z*(a); at time t = Z*{a) the algorithm starts an optimal 
tour that serves the quota and ends back in the origin. 

Since WaG starts its tour at time Z* , it concludes it at time t' < 2 ■ Z* . So 
we can state the following: 

Theorem 2. Algorithm WaG is 2-competitive for the OL-QTSP problem in 
general metric space. 
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Clearly the upper bound provided by the algorithm matches the lower bound 
previously given for general metric spaces and hence no better competitive ratio 
can be established in the general case. 

Note also that, if we deal with the non-weighted problem, where Q is the 
number of requests, if Q is fixed and it is not part of the input the optimal 
solution of the QTSP problem can be computed in polynomial time and, there- 
fore, WaG is a polynomial time algorithm. If Q is part of the input, Tsitsiklis 
[19] proved that, for the off-line problem, the optimal solution can be computed 
in polynomial time if the underlying metric space is the line (or the halfline), 
while, in order to compute Z*(a) for general metric spaces, we have to resort to 
an approximation algorithm for the QTSP . Hence if such algorithm provides a 
solution with approximation ratio c we obtain a 2 • c competitive algorithm. 

2.3 Particular Cases 

In Theorem 1 we proved that, provided that Q = 2 and a minimum of 3 requests 
are released, no algorithm can achieve a competitive ratio better than 2. In this 
section we deal with two particular cases that we obtain when we release the 
constraint on the quota and the number of requests. 

We start with the case when Q = 1, and we prove that, for such case, OL- 
QTSP admits a competitive ratio 3/2. 

The algorithm guaranteeing such a ratio is the following: 

Algorithm 2 (Eventually Replan (ER)) The server waits for the first mo- 
ment to where a request r is released at distance at most to; it goes to serve it at 
full speed; if on the road, another request is released such that the algorithm can 
serve it and return to the origin sooner than for r, it decides to serve this new 
request. 

Theorem 3. ER is i/ 2- competitive for the OL-QTSP with Q = 1. 

Proof. Suppose that an optimal algorithm should serve a request vi released at 
distance Xi from the origin at moment t\; then, Z* = max{ti,xi} xi. 

Assume first Z* = 2x\ {x\ > fi). Then the server moved by ER starts before 
moment t = x\ and returns to the origin (after serving a request) before t = 3a; i. 
So, the competitive ratio is p < 3a;i/2a;i = 3/2. 

Assume now Z* = t\ -\- x\ (a;i < ti). If OL has not yet moved at moment 
t = ti, then it moves in order to serve ri (since 2a;i < 2ti). In this case the 
competitive ratio is p = (t i -I- 2a;i )/ (ti -I- a;i ) <3/2 (recall that we deal with case 

< ti). If the on-line algorithm has started to move before moment t = x\, 
then p < 3a;i/(a;i -I- t\) < 3/2. Finally, assume that OL starts moving at the 
moment t = to, where xi < to < ti, in order to serve a request ro released 
at distance Xq < to- At t = ti, d{0,0L{ti)) < t\ — to. Independently on the 
request, the on-line algorithm has decided to serve at t = ti (this request may or 
may not be ro), it decides to continue its way (guaranteeing 3to), or to serve r\. 
Hence, it can return in the origin before t = min{3to,ti + (ti — to) + 2xi} = 
min{3to, 2Z* - to} < {3f2)Z*. 

So, competitive ratio 3/2 is always guaranteed. □ 
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Now we deal with the case when Q = 2 and only two requests are released. We 
show in the sequel that, also in this case, we can achieve competitive ratio 3/2. 
The algorithm achieving this ratio is the following: 

Algorithm 3 (Serve and Wait (SaW)) The server starts at the first mo- 
ment to such that it can serve a request rg before time 2 • to; once arrived in 
place Vq is released, it waits until the second request is released; it serves it and 
it then returns in the origin. 

Theorem 4. SaW is 3/2-competitive for the OL-QTSP with Q = 2 and a total 
of two requests released. 

Proof. Denote by ri and V 2 the two requests released and assume that they are 
served in this order by an optimal algorithm; assume that this algorithm serves ri 
at instant ti, T 2 at instant T 2 and that it returns in the origin at instant T 3 = Z* . 
We distinguish the two following cases depending on the order followed by the 
server moved by SaW. 

Assume first that OL serves ri before V 2 . It starts before moment t = Ti 
(this moment is the latest departure moment for the server) since it is possible 
to serve ri before moment 2 • ri and this request is already released at this 
moment. Since it aims at r\, it serves it at moment t = t\ -|-(i(0, r\). If request T 2 
is already released at this moment, then the server can serve both r\ and r 2 
and return to the origin before moment t\ d{0, ri) -I- (t 2 — ti) -I- (rg — T2) = 
To d(0,ri) < 3/2 t3 = 3j2Z*. Otherwise, it ends its tour at most at moment 
T2 = d(ri,r2) -I- (t3 - T 2 ) = to + d{ri,r2 ) < 3/2to = 3/2Z*. 

Assume now that the server moved by SaW serves V 2 before r\. Then it starts 
before t = t\ (latest departure moment) and serves V 2 before t = t\ {to — T 2 ) . 
At this moment, r\ is already released; hence server serves it and returns to the 
origin before moment T\ (rg — T 2 ) + (t 2 — n) -I- d{0, ri) < 3/2t3. 

Therefore, in both of the above cases, competitive ratio achieved is bounded 
above by 3/2 and the proof of the proposition is complete. □ 

3 OL-QTSP on the Halfline 

In this section we consider the case in which the metric space is the half-line. For 
such case we are able to show a competitiveness lower bound and a matching 
upper bound. As we said before the off-line version of this problem is solvable 
in polynomial time [19]. 

Theorem 5. For the OL-QTSP problem, when the metric space is the halfline, 
for any Q > 1, there are no p-competitive algorithms where p < 3/2. 

Proof. We start the proof for the case in which Q = 1. 

At time 1 one request in point 1 is released. Due to the speed constraint 
(Lemma 1), the server moved by the on-line algorithm cannot reach the request 
before time t = 2, otherwise the adversary might release immediately a request 
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in the origin. So the on-line server cannot be back in the origin before time t = 3, 
while optimal completion time is 2. 

For the case in which Q > it is sufficient to use the above sequence in 
which we release Q requests at the same time in one point. □ 

We now present a 3/2-competitive algorithm that matches the above lower 
bound for the problem. 

Algorithm 4 (Slow Walk (SW)) The on-line server OL moves from the ori- 
gin at half speed until the first time to in which it is possible i) to serve a quota 
Q over the weights of the requests and ii) come hack to the origin in no more 
than to/2; then it goes hack to the origin at full speed. 

Theorem 6. For the weighted OL-QTSP problem in the halfline, and for any 
value of the quota Q, the algorithm SW is p-competitive with p = 3/2. 

Proof. Basically, we have to prove that the on-line algorithm proceeds in such a 
way to conclude its tour not later than time to -\- to /2 while the adversary uses at 
least time to • Let us first observe that if we consider to = Z* all requests served 
by the adversary in an optimal solution have to lie between O and to/ 2 and have 
been released before time to- Therefore, at time to = Z*, SW can turn back 
because both requirements are met and it can conclude its tour by serving all 
requests on its way to the origin. The overall time it uses is to + ^o/2 = i/2Z* □ 

4 Application to the General OL-TSP Problem 

In the previous section we have observed that the server can gain in competi- 
tiveness by not moving too fast. Other results based on the same approach can 
be found in [17], where the notion of “non zealous” algorithm is defined, as op- 
posed to the notion of “zealous” algorithm [7] that is, intuitively, an algorithm 
that never remains idle when there is work to do. In this section we consider the 
different problem of the general On-Line TSP, in the homing version (H-OLTSP 
[3]) that is the version in which the server has to conclude its tour in the ori- 
gin. By applying the same “slow is good” paradigm we show that, also in the 
H-OLTSP case, the “non-zealousness” of the server can help him in achieving a 
better competitiveness ratio (see also [17]). 

We consider a metric space M and denote by Z\-TSP the metric TSP. In 
H-OLTSP each instance of requests denotes the on-line version; each instance of 
requests is a sequence {pi,ti)i>o and the sequence (U) increases. Given such an 
instance, the traveler has to visit every point Pi (after tf) in the least possible 
time, given the fact that, at each time, he only knows the already revealed points. 
The following on-line algorithm, named Wait and Slowly Move (WaSM), 
uses a polynomial time r-approximation algorithm A for Z\-TSP. 

Without loss of generality one can suppose that, for every Z\-TSP instance 
(pi, . . . ,pn) we have: 



yt,d{0,p*{t)) < max(d(0,pi)) 

i 
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Algorithm 5 (Wait and Slowly Move (WaSM)) 

1) B ^ (3-byT7)/2 

2) as a new request is presented at time to, then: 

the on-line traveler comes back to the origin; 
he waits until time to(l + ^/B); 

he then follows the solution performed by A with speed v{t) such that: 
if d{p^^{t),0) > t/B then v(t) = 0 (ball-constraint) 
else v{t) = 1; 

Note that the ball-constraint means that, at every time t, the traveler is not 
allowed to be further than t/B from the origin. 

Theorem 7. If A — TSP admits a polynomial time r- approximation algorithm, 
then WaSM is a {r -\- c)- competitive algorithm for the H — OLTSP, where c = 
(1-b VT7)/4~ 1.2808. 

Proof Observe that the value B = {3 -\- y/\l)/2 satisfies {B — l)/2 =1-1- 1/B. 
Let us first note that, as the on-line traveler is constrained always to remain at 
a distance at most t/B from the origin (ball-constraint), then, whenever a new 
request is presented at time to , he can be guaranteed to be at the origin at time 

^o(l + 1/-B)- 

Let then t be the time the last request is presented. At time t(l -I- 1/B) the 
on-line traveler is in the origin and begins to follow the solution provided by A. 
Let us consider the two cases in which the on-line traveler has to stop after time 
t{l -\- 1/B) (because of the ball-constraint) or not. 

Case 1: 3ti > 0,p^^{t{l -\-l/B) -\- tf) = [t(l -I- 1/B) -\- ti]/B. Let us denote 
by t' be the maximum such time, which means that p‘^'^(t(l -\- 1/ B) -\- t') = 
[t{l-\- 1/ B) -\-t']/ B and that Vr > t' ,p^^(t{l -\-1 /B)-\-t) = [t(l + 1/B) -\-t\/B 
(from then on, the on-line traveler does not stop until the end of the tour). 

Note that the distance covered between t{l -\- 1/B) and t{l -\- 1 / B) -\- 1' is at 
least [t{l 3- 1/B) -\- t']/B, consequently 

ZOL <t(^i + i/B)+t' + r- Z* - [t{l + 1/B) + t']/B (1) 

On the other side the furthest request being at distance at least [t(l + 1/B) 3- 
t'\/B from the origin, we have: 

Z* >2[t{l + l/B) + t']/B (2) 

From relations 1 and 2 we deduce that, in this case, the related competitive ratio 
is no more than r -\- {B — l)/2. 

Case 2: Vti > Q,p^^(t{l + 1/B) + t{) < [t(l -b 1/B) + t{\/B. Then, 

Z^^ < t{l + l/B) + rZ* 

and 

Z* > t 



Consequently a competitive ratio of (1 -b l/B-br) = (S — l)/2-br is guaranteed, 
which concludes the proof. □ 
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As a consequence of this theorem, if we use Christofides heuristic [12], that 
provides an approximation factor of r = 1.5, we can easily state the following 
result. 

Corollary 1. H-OLTSP admits a 2.7 S- competitive polynomial time algorithm. 
This result improves the 3-competitive result for this problem proved in [3] . 

5 Conclusions 

In this paper the Quota Traveling Salesman Problem has been tackled from the 
on-line point of view. First we have shown that for general metric spaces the 
problem can be solved by means of a 2-competitive algorithm while no better 
competitive algorithm is possible. In particular since the lower bound result is 
proved on the real line no better on-line algorithm can be expected even for such 
particular case. On the contrary, if we consider the half- line metric space we 
showed a 3/2 competitive algorithm and a matching lower bound. A peculiarity 
of the OL-QTSP problem is that no zealous algorithm can achieve the best 
competitive performance. 

Although for the classical Homing OL-TSP exponential time zealous algo- 
rithms have been shown to reach the best possible competitive ratio, we show a 
non zealous polynomial time algorithm which achieves a 2.78 competitive ratio, 
outperforming the best known polynomial time algorithm for the same problem. 

The problem to find a polynomial time algorithm for OL-QTSP with a good 
competitive ratio is still open. 
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Abstract. In this paper we show that an outerplanar graph G with 
maximum degree at most 3 has a 2-D orthogonal drawing with no bends 
if and only if G contains no triangles. We also show that an outerplanar 
graph G with maximum degree at most 6 has a 3-D orthogonal drawing 
with no bends if and only if G contains no triangles. 



1 Introduction 

We consider the problem of generating orthogonal drawings of outerplanar graphs 
in the plane and space. The problem has obvious applications in the design of 
2-D and 3-D VLSI circuits and optoelectronic integrated systems. 

Throughout this paper, we consider simple connected graphs G with vertex 
set V{G) and edge set E{G). We denote by (1g{v) the degree of a vertex v in 
G, and by A{G) the maximum degree of vertices of G. G is called a fc-graph 
if 2\(G) < k. The connectivity of a graph is the minimum number of vertices 
whose removal results in a disconnected graph or a single vertex graph. A graph 
is said to be /c-connected if the connectivity of the graph is at least k. 

It is well-known that every graph can be drawn in the space so that its edges 
intersect only at their ends. Such a drawing of a graph G is called a 3-D drawing 
of G. A graph is said to be planar if it can be drawn in the plane so that its 
edges intersect only at their ends. Such a drawing of a planar graph G is called 
a 2-D drawing of G. 

A 2-D orthogonal drawing of a planar graph G is a 2-D drawing of G such 
that each edge is drawn by a sequence of contiguous horizontal and vertical line 
segments. A 3-D orthogonal drawing of a graph G is a 3-D drawing of G such that 
each edge is drawn by a sequence of contiguous axis-parallel line segments. Notice 
that a graph G has a 2-D [3-D] orthogonal drawing only if A(G) < 4[Z\(G) < 6]. 
An orthogonal drawing with no more than b bends per edge is called a 6-bend 
orthogonal drawing. 

Biedl and Kant [2], and Liu, Morgana, and Simeone [7] showed that every 
planar 4-graph has a 2-bend 2-D orthogonal drawing with the only exception of 
the octahedron, which has a 3-bend 2-D orthogonal drawing. Moreover, Kant [6] 
showed that every planar 3-graph has a 1-bend 2-D orthogonal drawing with the 
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only exception of K/i. On the other hand, Garg and Tamassia proved that it is 
NP-complete to decide if a given planar 4-graph has a 0-bend 2-D orthogonal 
drawing [5]. Battista, Liotta, and Vargiu showed that the problem can be solved 
in polynomial time for planar 3-graphs and series-parallel graphs [1]. 

We show in Section 3 that an outerplanar 3-graph G has a 0-bend 2-D or- 
thogonal drawing if and only if G contains no triangle as a subgraph. 

Eades, Symvonis, and Whitesides [4], and Papakostas and Tollis [8] showed 
that every 6-graph has a 3-bend 3-D orthogonal drawing. Moreover, Wood 
showed that every 5-graph has a 2-bend 3-D orthogonal drawing [10]. On the 
other hand, Eades, Stirk, and Whitesides proved that it is NP-complete to decide 
if a given 5-graph has a 0-bend 3-D orthogonal drawing [3] . 

We show in Section 4 that an outerplanar 6-graph G has a 0-bend 3-D or- 
thogonal drawing if and only if G contains no triangle as a subgraph. 

It is interesting to note that a complete bipartite graph ^^ 2 , 3 ) which is a 
minimal non-outerplanar graph with no triangles, has no 0-bend 2-D or 3-D 
orthogonal drawing. 

2 Preliminaries 

A 2-D drawing of a planar graph G is regarded as a graph isomorphic to G, and 
referred to as a plane graph. A plane graph partitions the rest of the plane into 
connected regions. A face is a closure of such a region. The unbounded region is 
referred to as the external face. We denote the boundary of a face / of a plane 
graph r by b{f). If F is 2-connected then b{f) is a cycle of F. 

Given a plane graph F, we can define another graph F* as follows: corre- 
sponding to each face / of T there is a vertex f* of F*, and corresponding to 
each edge e of E there is an edge e* of E*; two vertices f* and g* are joined by 
the edge e* in F* if and only if the edge e in E lies on the common boundary of 
faces / and g of E. E* is called the (geometric-)dual of E. 

A graph is said to be outerplanar if it has a 2-D drawing such that every 
vertex lies on the boundary of the external face. Such a drawing of an outerplanar 
graph is said to be outerplane. Let E be an outerplane graph with the external 
face fo, and E* — f* be a graph obtained from E* by deleting the vertex f* 
together with the edges incident to /*. It is easy to see that if E is an outerplane 
graph then F* — f* is a forest. In particular, an outerplane graph E is 2-connected 
if and only if E* — /* is a tree. 

3 2-D Orthogonal Drawing 

An edge of a plane graph E which is incident to exactly one vertex of a cycle G 
and located outside G is called a leg of G. A cycle G of E is said to be k-legged 
if G has exactly k legs. 

The planar representation E(E) of a plane graph E is the collection of circular 
permutations of the edges incident to each vertex. Plane graphs E and F' are 
said to be equivalent if E(E) is isomorphic to P{F'). 
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The following interesting theorem was proved by Rahman, Naznin, and 
Nishizeki [9]. 

Theorem I A plane 3-graph F has an equivalent 0-bend 2-D orthogonal draw- 
ing if and only if every k-legged cycle in F contains at least 4 — fc vertices of 
degree 2 in F for any k, 0 < k < 3. □ 

We show in this section the following theorem. 

Theorem 1. An outerplanar 3-graph G has a 0-bend 2-D orthogonal drawing if 
and only if G contains no triangle as a subgraph. 

Proof : The necessity is obvious. We show the sufficiency. Let G be an outer- 
planar 3-graph with no triangles, and F be an outerplane graph isomorphic to 
G. We show that F satisfies the condition of Theorem I. 

Lemma 1. If F is 2-connected then the boundary of the external face fo contains 
at least 4 vertices of degree 2 in F . 

Proof of Lemma 1: If T is a cycle then the lemma is obvious. Suppose that 
F has more than one cycle. Since F is 2-connected, F* — f* is a tree. Since 
F contains no triangles, the boundary of a face of F corresponding to a leaf 
of F* — /* contains at least 2 vertices of degree 2 in F, which also lie on the 
boundary of fo- Since a tree has at least 2 leaves, we obtain the lemma. □ 

It is easy to see that every cycle C of T is the boundary of the external face 
of a 2-connected outerplane subgraph of F. Thus, by Lemma 1, C contains at 
least 4 vertices of degree 2 in the subgraph. It follows that if C is a k-legged cycle 
in F then G contains at least d — k vertices of degree 2 in F. This completes the 
proof of the theorem. □ 

It should be noted that there exists an outerplanar 4-graph with no triangles 
that has no 0-bend 2-D orthogonal drawings. Fig. 1 shows such a graph F. F has 
a pentagon and five squares. If F has a 0-bend 2-D orthogonal drawing then the 
pentagon and squares are drawn as rectangles. All the squares must lie outside a 
rectangle R representing the pentagon. This is impossible, however, since there 
exists a pair of consecutive squares which lie to the same side of R. 

4 3-D Orthogonal Drawing 

We show in this section the following theorem. 

Theorem 2. An outerplanar 6-graph G has a 0-bend 3-D orthogonal drawing if 
and only if G contains no triangle as a subgraph. □ 

The necessity is obvious. We will show the sufficiency in the rest of the 
section. 
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Fig. 1. An outerplanar 4-graph F. 



4.1 2-Connected Outerplanar Graphs 

We first consider the case when G is 2-connected. Let G be a 2-connected out- 
erplanar 6-graph with no triangles, and F be an outerplane graph isomorphic to 
G. Since F is 2-connected, y* = _ f* jg ^ tree. A leaf r* of T* is designated 

as a root, and T* is considered as a rooted tree. If g* is a child of f* in T* , f 
is called the parent face of g, and g is called a child face of / in G. The unique 
edge in b{f) n b{g) is called the base of g. The base of r is defined as an edge 
with both ends of degree 2. Let S* be a subtree of T* containing r*. S* is also 
considered as a tree rooted at r*. F{S*) is a subgraph of F induced by the ver- 
tices on boundaries of faces of F corresponding to the vertices of S* . It should 
be noted that F{S*) is a, 2-connected outerplane graph with no triangles. Let f* 
be a vertex of S*, and f*GV (T*) — V (S*) be a child of f* in T*. S'* -|- /* is a 
rooted tree obtained from S* by adding /* and an edge (/*, /*). 

For any face / of F, b{f) is a cycle, since F is 2-connected. Let b{f) = 
{eg, ei, . . . , Cfc-i}, where eg is the base of /, and edges Cj and mod fc) are 
adjacent. A 0-bend 2-D orthogonal drawing of / is said to be canonical if / is 
drawn as a rectangle such that the edges C2, eg, ... , and ek -2 are drawn on a 
side of the rectangle. A drawing of F{S*) is said to be canonical if every face is 
drawn canonically. 

Fig. 2 shows a rooted tree T* for F shown in Fig.l, where r is a square face, 
and a 0-bend 3-D orthogonal canonical drawing of F. 

Roughly speaking, we will show that if F{S*) has a 0-bend 3-D orthogonal 
canonical drawing then F{S* + f*) also has a 0-bend 3-D orthogonal canonical 
drawing. The following theorem immediately follows by induction. 

Theorem 3. A 2-connected outerplanar 6-graph with no triangles has a 0-bend 
3-D orthogonal drawing. □ 



4.1.1 Proof of Theorem 3 For any v G V{F), we define that /„ is a face 
such that V is on 6(/„) and /* is the nearest vertex to r* in T*. We denote by 
Ir{v) the set of edges incident with v in F. 
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Fig. 2. T* for F and a 0-bend 3-D orthogonal canonical drawing of F. 



Let rl(S'*) be a 0-bend 3-D orthogonal canonical drawing of F{S*). We as- 
sume without loss of generality that each vertex of A{S*) is positioned at a 
grid-point in the three-dimensional space. Let <j) : V{F{S*)) y(rl(S'*)) be an 

isomorphism between F{S*) and A{S*). The mapping (f) is called a layout of 
r{S*). If 4>{v) = (vx,Vy,Vz), we denote = 4>x{v),Vy = 4>y{v), and = 4>z(v). 
Notice that A{S*) is uniquely determined by (p. 

Let Bx = (1, 0, 0), By = (0, 1, 0), Bz = (0, 0, 1) and define that T> = {b^, By, e^, 
—Bx, —£y, — 6 z}. For any v € V{r{S*)), /?„ is a one-to-one mapping from Ir{v) 
to T>. If / is a face with the base e = (u,v), and fi is a child face of / with 
the base ci = {u,v'), let { 62 } = b{fi) fl Ir{u) — {ei}- A mapping /3„ is said to 
be admissible if /i does not exist or / 3 „(e 2 ) is orthogonal with both /3„(e) and 
/3„(ei). B{S*) = {f3y I V G V{r{S*))} is called a canonical orientation for A(S'*) 
if the following conditions are satisfied: 

(Bl) For any v G V{r{S*)) and e,e' G if e,e' G 6 (/) for a face / yf /„ of 

r then Pv{e) and j3y{e') are orthogonal. 

(B2) If Co = {u, v) G E{r{S*)) is the base of a face / in F, {ei} = 6(/)n/r(u) — 
{eo}, and {efe_i} = 6 (/) D Ir(v) - {eo} then /3„(ei) = /3„(efc_i). 

(B3) For any e = (u,v) G E{F{S*)), (j>{u) = cpiy) + mf3y{e) and (piy) = (p{u) + 
m(3u{e) for some integer m. 

(B4) For each face with base (u,v), /3„ or j3y is admissible. 

We prove the theorem by induction. The basis of the induction is stated in 
the following lemma, whose proof is obvious. 

Lemma 2. E{r*) has a 0-bend 3-D orthogonal drawing with a canonical orien- 
tation. □ 

Let f* be a vertex of S* with a child f* G V(T*) — V{S*). 

Lemma 3. If E{S*) has a 0-bend 3-D orthogonal canonical drawing with a 
canonical orientation then r{S*-\- ff) also has a 0-bend 3-D orthogonal canonical 
drawing with a canonical orientation. 

Proof of Lemma 3: Let A(S'*) be a 0-bend 3-D orthogonal canonical drawing 
of r{S*) with a canonical orientation B{S*) = {/3„ | v G y(r(S'*))} , and (p be 
the layout of r{S*). 

Let b{fc) = {eo = {vo,Vk-i),ci = {vo,vi), . . . ,Ck-i = {vk- 2 ,Vk-i)} , where 
Co is the base of /c. We assume without loss of generality that (ivoi^o) = 
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and /3«o(ei) = By. Now we define a layout (j)' of r{S* + /*). For the vertices 
wi, U 2 , • • • , Wfe -3 on 5(/c), we define that (f)' {vi) = (p{vo)+ey + {i — l)ex, I < i < k— 
3. We also define that 4>'{vo) = 4>{vo), = max.{(j)x{vk-i) , (j>x{vo)+k — 3}, 

and 4>'{vk-2) = 4‘'{'Vk-i) + By. Let I = 4>'^{vk-i) + k — 4 >x{vq). For each vertex 
V G V{r{S*)) — {fo, ffe-i, ffc- 2 } ) we define that 



(t>'xiv) 

W 

€iv) 



f ^x(v) 

\ (/>x(v) + I 

f <Py(v) 

{(pyiv) + 1 






if (px{v) < (l)x{vo), 
if <l)x{v) > (l)x{vo), 

if (j)y{v) < 4>y{vo), 
if (l)y{v) > (f>y{vo), 



Since P{S*) satisfies (Bl), (B2), and (B3), (j)' is well-defined and induces a 
0-bend 3-D orthogonal drawing 2l(S'* -|- /*) of r{S* + f^), as easily seen. 

It remains to show a canonical orientation B{S* + f*) for A{S* + f*). Let f* 
be the children of /* in T* such that G b{fc) is the base of /i, 1 < f < fc — 1, 
if any. We first consider a mapping a from the children of f* to {0, 1, —1}. 

Claim 1. A partial mapping a on {f\, fk-i\with |a(/i)| -I- |a(/fc-i)| yf 0 can 
he extended to a mapping such that 

(Al) a{fi) yf a(/j+i) fori = l,k- 2, 

(A2) |a(/j) - a{fi+i) \ = 1 for 2 < i < k - 3. 

Proof of Claim 1: It suffices to consider the following cases by symmetry. 

Case 1 a(/i) < 0 and a{fk-i) = — 1 : We define that 

. ^ _ J 1 if z is even, 
o(/ij Q otherwise. 

Case 2 a(/i) = 1 and o;(/fe_i) = — 1 : We define that 

i f \ _ j 0 if z is even, 

'af/ij otherwise. 



It is easy to see that a defined above is a desired mapping. □ 

B{S* + ff) will be defined as an extension of B{S*), that is, B{S*) C B{S* + ff). 
It suffices to define f3y^ (1 < z < fc — 2) for the vertices zzi, U 2 , . . . , Vk -2 on 6(/c). 
Let {e[} = Ir(vo) n 6(/i) - &(/c) and = /r(ffc-i) n b{fk-i) - 6(/c), if 

any. We define a partial mapping a as: 



a(/i) 



1 if (ivoie'i) = fivoieo) x fivoiei), 

-1 if Pvoie'i) = -fdvoieo) x /3„o(ei), 

0 otherwise. 



and 



a(/fc-i) 



1 if — Pvk-ii^o) ^ Pvk-ii^k-i)} 

— 1 Pvk-ii^k-i) ~ ~Pvk-ii^o) X (cfe-i), 

0 otherwise. 
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where x denotes the exterior product of verctors. Since B{S*) satisfies (B4), we 
have |a(/i)| + |a(/fe_i)| 0. So let a be a mapping satisfying the conditions of 

Claim 1. 

For each vertex Vi (1 < i < k — 2), we label the edges in Ir{vi) as follows. Let 
ei]^ = e-i- If e is the base of a child face of face fi, let = e. In general, 
if e is the base of a child face of face let = e, if any. If 

has no such child face and {e} = fl Ir(vi) — then we defined that 

= e. Let = e^+i. If e is the base of a child face of face fi+i, let 
= e. In general, if e is the base of a child face of /i+i> f®t = e, 

if any. If has no such child face and {e} = fl Ir(vi) — {ei'’?}, then 

we defined that = e. 

We first define for eif , and if any: 




if i = 1, 

if 2 < i < k — 2, 

if 1 < z < A: — 3, 
if z = A: — 2, 

if a{fi) = 0 and z = 1, 
if a{fi) = 0 and 2 < i < k — 2, 
if a(/*) = 1, 
if a(/*) = -1, 

if a{fi) = 0 and 1 < z < A; — 3, 
if a{fi) = 0 and i = k — 2, 
if ctlfi) = 1, 
if a(/0 = -1- 



We next define for and if any. We define that: 






-e^ if a(/i) = ±1, 

if a(/i) = 0 and a(/ 2 ) = -1, 
-e^ if a(/i) = 0 and a(/ 2 ) = 1, 

ey if a(/2) = ±1, 

if a(/ 2 ) = 0 and a(/i) = -1, 
-e^ if a(/ 2 ) = 0 and a(/i) = 1. 



if any. It should be noted that o;(/i) A o(/ 2 ), since a satisfies (Al). For 2 < z < 
k — 2>, we define that: 

), 

if any. We define that: 

( By if a{fk-2) = ±1, 

/3«fe-2(4?_2) = S if a(/fc- 2 ) = 0 and a(/fc-i) = -1, 

[ -e^ if a{fk- 2 ) = 0 and a{fk-i) = 1, 
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( if a{fk-i) = ±1, 

/3«fc-2(eill2) = S if a(/fc-i) = 0 and a{fk- 2 ) = -1, 

[ -e^ if a(/fc_i) = 0 and a(/fc_ 2 ) = 1, 

if any. It should be noted that a{fk- 2 ) ^ ct{fk-i), since a satisfies (Al). 

Since a satisfies (Al) and (A2), it is not difficult to verify that B{S* + /*) 
defined so far satisfies the conditions (Bl) through (B4), and is a canonical ori- 
entation for A(S'* -I- /*). □ 



4.2 General Outerplanar Graphs 

We next consider the general case when G is a connected outerplanar 6 -graph 
with no triangles, and complete the proof of Theorem 2. 

For graphs Gi and G 2 , Gi U G 2 is a graph defined as V (Gi U G 2 ) = V (Gi) U 
V (G 2 ) and E{Gi U G 2 ) = E{G\) U E{G2)- A subset 5 of I? is said to be suitable 
if vectors vci T> — S can be linearly arranged such that adjacent vectors are 
orthogonal. Notice that S is not suitable if and only \i T> — S = {ea,—ea} 
for some a G {x,y,z}. Let E be an outerplane graph isomorhpic to G, A be 
a 0-bend 3-D orthogonal canonical drawing of T with a canonical orientation 
B = {Pv\v G y(r)}, and Epiv) = {/3„(e)|e G Ir(v)}. A is said to be suitable if 
Er{v) is suitable for every vertex v G V{E). 

In order to complete the proof of Theorem 2, it is sufficient to show the 
following. 

Lemma 4. Let Ei be an outerplane graph without triangles, and A\ he a suitable 
0-hend 3-D orthogonal canonical drawing of Ei with a canonical orientation B = 
{I3v\v G y(Ti)} and a layout (jA . Let I 2 he a 2-connected outerplane graph 
without triangles or a graph consisting of an edge such that |V^(A) ny(/ 2 )| = 1 
and T 1 U /2 is a 6-graph. Then, T 1 UI 2 also has a suitable 0-hend 3-D orthogonal 
canonical drawing. 

Proof of Lemma 4: Let {w} = V (A) n ^(A) and Er{w) = {ai, 02 , . . . , Og} 
such that ai and are orthogonal (s < 6 ). We assume without loss of gener- 
ality that ai = Ex and a 2 = By. We distinguish two cases. 

Case 1 A is an edge: Let A(A) = {(w,w')}. Then, we define a layout (f)' of 
A U A as follows: 

"A (j>x{v) <(j>x{w), 

\ (v) -\- Ex A (j)x{v) > 4>x{w) -\- 1, 

= (jA{w) -G Ex. 

If we define f}w{w,w') = Ex, it is easy to see that (j)' induces a suitable 0-bend 
3-D orthogonal canonical drawing of A U A with a canonical orientation. 

Case 2 A is 2-connected: Let rj b® ^ of Tf such that w is on 6 (r 2 ), and 
b{i^2) = {{w,vi),{vi,V2), ■ ■ ■ ,{vk-2,Vk-i),{vk-i,w)}. Let e be the base of the 
unique child face of C 2 . We show a layout of A U A for the case of e = {w, wi). 
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Layouts of A U A for other cases can be obtained by similar ways. We define a 
layout (j)' of A U A(r 2 ) as follows: for each u € f^(A), 



C(w) 

(jj'yiu) 



{(j)l{u) + k-5 if (j)l{u) > (j)l{w) + 1, 

J 'Pl(u) if ^y(u) < 

I <Py(u) + 1 if <PI(u) > <f>l(w) + 1, 



and, for each Vi G f^(A(A)), 

,,, , f (/>^(w) + By + (i - l)e^ l<i<k-2, 

<■<»■) = 1 0i(„) + (fc-3K , = 



If we define /3m (e) = and /3m(w,ui) = e^, it is easy to verify that 4>' induces 
a 0-bend 3-D orthogonal canonical drawing rli U rl(r 2 ) of A U A(r5) with a 
canonical orientation. 

Since fF A^ijA(r'){w) is suitable for rli U ^(A), we can produce a suitable 0- 
bend 3-D orthogonal canonical drawing of A U A with a canonical orientation 
by Lemma 3. □ 



Since it takes 0{n) time to draw a face, and the number of faces is 0{n) for 
an n-vertex outerplanar graph, a 0-bend 3-D orthogonal drawing of an n-vertex 
outerplanar graph can be obtained in 0{n^) time. 
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Abstract. A canonical decomposition, a realizer, a Schnyder labeling 
and an orderly spanning tree of a plane graph play an important role in 
straight-line grid drawings, convex grid drawings, floor-plannings, graph 
encoding, etc. It is known that the triconnectivity is a sufficient condition 
for their existence, but no necessary and sufficient condition has been 
known. In this paper, we present a necessary and sufficient condition 
for their existence, and show that a canonical decomposition, a realizer, 
a Schnyder labeling, an orderly spanning tree, and an outer triangular 
convex grid drawing are notions equivalent with each other. We also show 
that they can be found in linear time whenever a plane graph satisfies 
the condition. 



1 Introduction 

Recently automatic aesthetic drawing of graphs has created intense interest due 
to their broad applications, and as a consequence, a number of drawing methods 
have come out [3-9,11-13,15,18,21,23]. The most typical drawing of a plane 
graph G is the straight line drawing in which all vertices of G are drawn as points 
and all edges are drawn as straight line segments without any edge-intersection. 
A straight line drawing of G is called a grid drawing if the vertices of G are put 
on grid points of integer coordinates. A straight line drawing of G is called a 
convex drawing if every face boundary is drawn as a convex polygon [4,22,23]. 
A convex drawing of G is called an outer triangular convex drawing if the outer 
face boundary is drawn as a triangle, as illustrated in Fig. 1. 

A canonical decomposition, a realizer, a Schnyder labeling and an orderly 
spanning tree of a plane graph G play an important role in straight-line draw- 
ings, convex grid drawings, floor-plannings, graph encoding, etc. [1,2,5-8,12, 
13,15-18,21]. It is known that the triconnectivity is a sufficient condition for 
their existence in G [5, 10, 12,21], but no necessary and sufficient condition has 
been known. In this paper, we present a necessary and sufficient condition for 
their existence, and show that a canonical decomposition, a realizer, a Schnyder 
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Fig. 1. An outer triangular convex drawing of a plane graph. 

labeling, an orderly spanning tree, and an outer triangular convex grid drawing 
are notions equivalent with each other. We also show that they can be found in 
linear time whenever G satisfies the condition. Algorithms for finding them have 
only been available for the triconnected case. 

2 Main Theorem and Definitions 

Let G be a plane biconnected simple graph. We assume for simplicity that the 
degrees of all vertices are larger than or equal to three, since the two edges 
adjacent to a vertex of degree two are often drawn on a straight line. Our main 
result is the following theorem; some terms will be defined later. 

Theorem 1. Let vi, V 2 and vs be vertices on the outer face Fo{G) of G appear- 
ing counterclockwise in this order, as illustrated in Fig. 2. Let Pi be the path 
from vi to V 2 on Fq(G), let P 2 be the path from V 2 to V 3 , and let P 3 be the path 
from V3 to vi- Then the following six propositions (a)-(f) are equivalent with 
each other. 

(a) G has a canonical decomposition with respect to vi,V 2 and V 3 . 

(b) G has a realizer with respect to vi,V 2 and V 3 . 

(c) G has a Schnyder labeling with respect to Vi,V 2 and V 3 . 

(d) G has an outer triangular convex grid drawing such that T’o(G) is drawn as 
a triangle V1V2V3. 

(e) G is internally triconnected, and has no separation pair {u,v} such that 
both u and v are on the same path Pi, 1 < i < 3. (See Figs. 1,2, 3,4.) 

(f) G has an orderly spanning tree such that V 3 is the root, vi is the minimum 
leaf, and V2 is the maximum leaf. 

It is known that (a)=^(b), (b)<t^(c) and (c)=^(d) [10,12,21]. Furthermore, 
(d)=>(e) holds as shown later in Lemma 1. In this paper, we complete a proof of 
Theorem 1 by proving (e)=^(a) in Section 3 and (b)<t^(f) in Section 4. 

In the remainder of this section, we present some definitions and known 
lemmas. 

We denote by G = (F, A) an undirected simple graph with vertex set V and 
edge set E. Let n be the number of vertices of G. An undirected edge joining 
vertices u and v is denoted by (u,v). We denote by (u,v) a directed edge from 
u to V. 
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Fig. 2. Illustration for the necessary Fig. 3. A separation pair {u, u} on P3. 

and sufficient condition. 



A graph is planar if it can be embedded in the plane so that no two edges 
intersect geometrically except at a vertex to which they are both incident. A 
plane graph is a planar graph with a fixed embedding. A plane graph divides the 
plane into connected regions called faces. We denote by Fo{G) the outer face of 
G. The boundary of Fo{G) is also denoted by Fo{G). A vertex on Fo{G) is called 
an outer vertex^ while a vertex not on F^iG) is called an inner vertex. An edge 
on Fo{G) is called an outer edge, while an edge not on Fq{G) is called an inner 
edge. 

We call a vertex u of G a cut vertex if its removal from G results in a dis- 
connected graph. A graph G is biconnected if G has no cut vertex. We call a 
pair {u,v} of vertices in a biconnected graph G a separation pair if its removal 
from G results in a disconnected graph, that is, G — {u,v} is not connected. A 
biconnected graph G is triconnected if G has no separation pair. A plane bicon- 
nected graph G is internally triconnected if, for any separation pair {u,v} of G, 
both u and v are outer vertices and each connected component of G — {u, u} 
contains an outer vertex. In other words, G is internally triconnected if and only 
if it can be extended to a triconnected graph by adding a vertex in an outer 
face and joining it to all outer vertices. An internally triconnected plane graph 
is depicted in Fig. 1, where all the vertices of separation pairs are drawn as white 
circles. If a biconnected plane graph G is not internally triconnected, then G has 
a separation pair {u,v} as illustrated in Figs. 4(a)-(c) and a “split component” 
F[ contains a vertex other than u and v. 

We now have the following lemma. 

Lemma 1. (d)=f>(e). 

Proof. Suppose that a biconnected plane graph G is not internally triconnected. 
Then G has a separation pair {u,v} as illustrated in Figs. 4(a)-(c), and a “split 
component” FI has a vertex w other than u and v. The degree of w is larger 
than or equals to three by the assumption of this paper, and hence the two faces 
marked by x cannot be simultaneously drawn as convex polygons. Thus G has 
no outer triangular convex drawing. 

Suppose that G has a separation pair {u,v} such that both u and v are 
on Pi, 1 < i < 3, as illustrated in Fig. 3. Then G has no outer triangular 
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(a) (b) (c) 

Fig. 4. Biconnected plane graphs which are not internally triconnected. 



convex drawing, because Pi must be drawn as a straight line segment, the split 
component H has a vertex of degree three or more other than u and v, and hence 
the face marked by x in Fig. 3 cannot be drawn as a convex polygon. 

The two facts above immediately implies (d)=J>(e). □ 



We then define a canonical decomposition [5]. Let V = {ui,U2, • • • , Un}- Let 
ui,U 2 and be three outer vertices appearing counterclockwise on Fo{G) in 
this order. We may assume that ui and U 2 are consecutive on Fo(G); otherwise, 
let G be the graph obtained by adding a virtual edge (ui,U 2 ) to the original 
graph. Let U = (Vi, V2, • • • , 14) be an ordered partition of V into nonempty 
subsets V\,V 2 , - ■ ■ ,Vh- Then 14 U V2 IJ • ■ ■ IJ 14 = V and 14 H ^ ® 

indices i and j, 1 < i < j < h. We denote by Gk, 1 < k < h, the subgraph 
of G induced by 14 U ^2 U ’ ' ' U by Gk the subgraph of G induced by 

14+1 U 14+2 U ■ ■ ■ U Note that G = Gh- We say that 7J is a canonical decom- 
position of G (with respect to U\,U 2 and u„) if the following three conditions 
(cdl)-(cd3) hold: 

(cdl) 14 consists of all the vertices on the boundary of the inner face containing 
the outer edge (^1,^2), and I4 = {««}• 

(cd2) For each index k, 1 < k < h, Gk is internally triconnected. 

(cd3) For each index k, 2 < k < h, all the vertices in I4 are outer vertices of 
Gk, and 

(a) if |14| = 1, then the vertex w in I4 has two or more neighbors in Gk-i 
and has at least one neighbor in Gk when k < h; and 

(b) if |14| > 2, then the vertices in I4 consecutively appear on Fo{Gk), 
each of the first and last vertices in 14 has exactly one neighbor in 
Gk-i, and all the other intermediate vertices in I4 have no neighbor 
in Gk-i, and each vertex in I4 has at least one neighbor in Gk- 

A canonical decomposition of the graph in Fig. 1 is illustrated in Fig. 5. 
Although the definition of a canonical decomposition above is slightly different 
from one in [5], they are effectively equivalent with each other. The following 
lemma is known. 

Lemma 2. [5] Every triconnected plane graph G has a canonical decomposition. 

The definition of a realizer [10] is omitted in this extended abstract, due to 
the page limitation. The following lemma is known on a realizer and a canonical 
decomposition . 
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Fig. 5. A canonical decomposition II = (Vi, V2, • ■ • , V 15 ) of the graph G in Fig. 1. 



Lemma 3. [10] If a plane graph G has a canonical decomposition with respect 
to Ui,U 2 and then G has a realizer with respect to U\,U 2 and 

The definition of a Schnyder labeling [12,21] is omitted in this extended 
abstract, due to the page limitation. The following three lemmas are known on 
the Schnyder labeling. 

Lemma 4. [12,21] Every triconnected plane graph G has a Schnyder labeling, 
and it can be computed in linear time. 

Lemma 5. [12] A plane graph G has a realizer if and only if G has a Schnyder 
labeling. 

Lemma 6. [12] If a plane graph G has a Schnyder labeling with respect to ai, 02 
and 03 , then G has an outer triangular convex grid drawing such that Fa{G) is 
drawn as a triangle 010203 and the size of grid is (n — 1 ) x (n — 1 ). 

We then define an orderly spanning tree [1, 2, 17]. Let T be a spanning tree of 
G rooted at an outer vertex ui of G. Let ui, U 2 , • • • , rin be the counterclockwise 
preordering of vertices in T as illustrated in Fig. 6 , where T is drawn by thick 
lines and each vertex Ui, 1 < i < n, is attached an index i. We call the leaf Ua 
of T having the maximum index the maximum leaf of T. Clearly a = n. We 
call the leaf up having the minimum index the minimum leaf of T. We say that 
two distinct vertices of G are unrelated if any of them is not an ancestor of the 
other in T. Let N{ui) be the set of all the neighbors of Ui in G. The set N{ui) 
is partitioned into the following four subsets Ni{ui) , N 2 {ui) , N 3 {ui) and N 4 {ui): 

Ni{ui) = {uj G N{ui) I Uj is the parent of Ui in T}, 

N 2 {ui) = {uj € N{ui) I j < i, Uj is not the parent of Ui\, 

Nsfu-i) = {uj € N{ui) I Uj is a child of Ui in T}, and 

Ni{ui) = {uj G N{ui) I j > i, Uj is not a child of Ui in T}. 
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Note that Ni{ui) = 0, |-/Vi(ui)| = 1 for each vertex m, 2 < i < n, and |iV 3 (ui)| = 
0 for each leaf Ui of T. We call T an orderly spanning tree of G if the following 
conditions (ostl) and (ost2) hold: 

(ostl) For each vertex Ui, 1 < i < n, Ui and any vertex uj € N 2 {ui) IJ N 4 {ui) are 
unrelated, and the vertices in Ni{ui), N 2 {ui), N 3 {ui) and N 4 {ui) appear 
around Ui counterclockwise in this order, as illustrated in Fig. 7; and 
(ost2) For each leaf Ui of T other than Ua and up, N 2 {ui), N 4 {ui) yf 0. 

Chiang et al. [2] define an orderly spanning tree only for maximal plane 
graphs, and there is no Condition (ost2) in their definition. They show that a 
maximal plane graph G has an orderly spanning tree if and only if G has a 
realizer [2] . We add Condition (ost2) since G is not necessarily a maximal plane 
graph in this paper. 

We have (a)=J>(b) by Lemma 3, (b)<^(c) by Lemma 5, (c)=^(d) by Lemma 6, 
and (d)=>(e) by Lemma 1. In order to prove Theorem 1, it suffices to prove 
(e)=>(a) and (b)<^(f). 




Fig. 6. An orderly spanning tree of the Fig. 7. Four sets Ni{ui), N 2 {ui), 

graph G in Fig. 1. N 3 {ui) and N 4 {ui). 



3 Proof of (e)=^(a) 

In this section, we give a proof of (e)=>(a). We first define some terms. 

If an internally triconnected plane graph G is not triconnected, then G has 
a separation pair of outer vertices and hence has a “chord path” defined below 
when G is not a single cycle. 

Let G be a biconnected plane graph, and let W\,W 2 , ■ ■ • ,wt be the vertices 
appearing on Fq{G) clockwise in this order. We call a path Q in G a chord-path 
if Q satisfies the following (i)~(iv): 

(i) Q connects two outer vertices Wp and Wq, p < q; 

(ii) {wp,Wq} is a separation pair of G; 

(iii) Q lies on an inner face; and 
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(iv) Q does not pass through any outer edge and any outer vertex other than 
the ends Wp and Wq. 

A chord-path Q connecting Wp and Wq is minimal if none of Wp+i , Wp+ 2 , ■ ■ ■ , Wq-i 
is an end of a chord-path. Thus the definition of a minimal chord-path depends 
on which vertex is considered as the starting vertex rci of Fo(G). 

Let {x\,X 2 , • • • , Xp}, p > 3, be a set of three or more outer vertices consecutive 
on Fa{G) such that d{xi) > 3, d{x 2 ) = d{x^) = • • • = d{xp-i) = 2, and d{xp) > 
3. Then we call the set {a; 2 , X 3 , • • • , Xp-i} an outer chain of G. 

We are now ready to prove (e)=l>(a). 

[Proof of (e)=^(a)]. Assume that the proposition (e) holds for G, that is, G 
is internally triconnected and has no separation pair {m, u} such that both u 
and V are on the same path Pi, 1 < z < 3, where Pi connects vi and V 2 , P 2 
connects V 2 and V 3 , and P 3 connects ^3 and v\. We shall show that G has a 
canonical decomposition 77 = (Vi, V 2 , • • • , Vli). If the outer vertices v\ and V 2 are 
not consecutive on Fq{G), then add a virtual edge (wi,W 2 ) to the original graph 
and let G be the resulting graph. Take u\ = v\^U 2 = V 2 , and = W 3 . Take as Vi 
the set of all the vertices on the boundary of the inner face containing the edge 
(ui, U 2 ), and take Vh = {««}. (See Fig. 5.) Then Un = ^ Vi, otherwise, {ui, U 3 } 

would be a separation pair of G on P 3 , a contradiction. Hence Condition (cdl) 
of a canonical decomposition holds. Since Gh = G is internally triconnected. 
Condition (cd2) holds for k = h. Since = W 3 has degree three or more. 
Condition (cd3) holds for k = h. G \s internally triconnected, and the outer 
vertex of G is not contained in any separation pair of G since (^ 1 ,^ 2 ) is an 
edge of G and {W 3 , a;} is not a separation pair of G for any vertex x on path P 2 
or P 3 . Therefore Gh-i = G — is also internally triconnected, and hence (cd2) 
holds for 7 = /i — 1. If y = Vi{jVh, then simply setting h = 2 we can complete 
a proof. One may thus assume that V yf Vi IJ V7 and hence h > 3. We choose 
Vh-i,Vh- 2 i • • • , V 2 in this order and show that (cd2) and (cd3) hold. 

Assume as an inductive hypothesis that h> i+1 >3 and the sets Vh,Vh-i, 
• • • , have been appropriately chosen so that 

(1) (cd2) holds for each index k > i, and 

(2) (cd3) holds for each index 7 > z -|- 1. 

We then show that there is a set Vi of outer vertices of Gi such that 

(1) (cd2) holds for the index 7 = z — 1, and 

(2) (cd3) holds for the index k = i. 

Let wi,W 2 , - • • ,wt be the outer vertices of Gi appearing clockwise on Fq{G) 
in this order, where wi = vi and Wt = V 2 - There are the following two cases to 
consider. 

Case 1: Gi is triconnected. Since Gi is triconnected and at least one vertex in 
Vi+i has a neighbor in Gi, there is an outer vertex w ^ V\ oi Gi which has a 
neighbor in Gi. We choose the singleton set {zc} as Vi. Since Gi is triconnected 
and w is an outer vertex of Gi, Gi-i = Gi — w is internally triconnected and w 
has three or more neighbors in Gi_i. Thus (cd2) holds for 7 = z — 1, and (cd3) 
holds for 7 = z. 
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Case 2: Otherwise. Since i > 2, Gi is not a single cycle. Gi is internally tricon- 
nected, but is not triconnected. Therefore there is a chord-path for Fo{Gi). Let 
Q be a minimal chord-path, let Wp and Wq be the two ends of Q, and let p < q. 
Then q > p + 2; otherwise, Gi would not be internally triconnected. We now 
have the following two subcases. 

Subcase 2a: {wp+i,Wp+ 2 , ■ ■ ■ ,Wq-i} is an outer chain of Gi. In this case we 
choose {wp+i, Wp+2, • • • , Wg-i} as Vi. Since Vi is an outer chain and Q is a 
minimal chord-path, we have Vif]Vi = 0. Each of Wp+i and Wq-i has exactly 
one neighbor in Gi_i, and each vertex w £ Vi has a neighbor in Gi because w 
has degree three or more in G and has degree two in Gi. One can thus know 
that (cd3) holds for k = i. 

We next show that (cd2) holds for k = i—1. Assume for a contradiction that 
Gi-i is not internally triconnected. Then Gi-i has either a cut vertex u or a 
separation pair {m, w} having one of the three types illustrated in Fig. 4. 

Consider first the case where Gi-i has a cut vertex v, as illustrated in Fig. 8. 
Then v must be an outer vertex of Gi and v Wp,Wq] otherwise, Gi would not 
be internally triconnected. The minimal chord-path Q above must pass through 
the outer vertex v, contrary to Condition (iv) of the definition of a chord-path. 

Consider next the case where Gi-\ has a separation pair {m, w} having one 
of the three types illustrated in Fig. 4. Then {u, w} would be a separation pair 
of Gi having one of the three types illustrated in Fig. 4, and hence Gi would not 
be internally triconnected, a contradiction. 

Subcase 2b: Otherwise. In this case, every vertex in {wp+\,Wp+ 2 , ■ ■ ■ , has 

degree three or more in Gp, otherwise, Q would not be minimal. Furthermore, 
we can show that at least one vertex w in {wp+i,Wp+ 2 , • • • , li'g-i} has a neigh- 
bor in Gi, as follows. Suppose for a contradiction that none of the vertices in 
{wp+i,Wp+ 2 , ■ ■ ■ , Wq-i} has a neighbor in Gi. Then {wp, Wq} is a separation pair 
of G, and Wp,Wp+\, - ■ ■ ,Wq is a path on Fo{G), as illustrated in Fig. 9. Since 
i <h — 1, none of Wp, Wp+\, • ■ ■ ,Wq is W3 = Thus both Wp and Wq are either 
on path P 2 or on path P3, and hence Proposition (e) would not hold for G, a 
contradiction. 

We choose the singleton set {w} as Vi for the vertex w above. Then clearly 
hi n hi = 0, and (cd3) holds for the index k = i. Since w is not an end of a chord- 
path of Fo(Gi) and Gi is internally triconnected, Gi_i = Gi — w is internally 
triconnected and hence (cd2) holds for the index k = i — 1. □ 

4 Proof of (b)<^(f) 

In this section, we give a proof of (b)<J4>(f). 

Sketchy proof of (b)<t>(f). We can prove that if (Tr,Ti,,Tg) is a realizer of G 
then Tg is an orderly spanning tree of G rooted at rg. The detail is omitted in 
this extended abstract, due to the page limitation. 

From an orderly spanning tree T of G we can construct a realizer (T^, T{,, Tg) 
of G. We take Tg = T. For each vertex 1 < z < n, we appropriately choose 
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Fig. 8. Graph Gi and the outer chain Vi. Fig. 9. Graph Gi. 



an edge incident to Ui as an outgoing edge of Th (or Tr) according to some rules. 
The detail is omitted in this extended abstract, due to the page limitation. □ 
Since the proof of (e)=l>(a) is constructive, we can immediately have a linear 
algorithm to find a canonical decomposition of G if Proposition (e) holds for G. 
Moreover, we can examine whether Proposition (e) holds for a given graph G, by 
using the linear algorithm for decomposing a graph to triconnected components 
[14]. Furthermore, one can know from the proof of (b)^(f) that if G has a 
realizer then one can immediately find an orderly spanning tree of G. We thus 
have the following corollary from Theorem 1. 

Corollary 1. If a plane graph G is internally triconnected and has no separation 
pair such that both u and v are on the same path Pi., 1 < i < 3, then 

one can find in linear time a canonical decomposition, a realizer, a Schnyder 
labeling, an orderly spanning tree, and an outer triangular convex grid drawing 
of G having size (n — 1) x (n — 1). 
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Abstract. It is known that for every integer k > 4, each fc-map graph 
with n vertices has at most kn—2k edges. Previously, it was open whether 
this bound is tight or not. We show that this bound is tight for k — 4. We 
also show that this bound is not tight for large enough k; more precisely, 
we show that for every 0 < e < ^ and for every integer k > each 
fc-map graph with n vertices has at most (||| -|- t)kn — 2k edges. We 
further show that for every positive multiple k of 6, there are infinitely 
many integers n such that some fc-map graph with n vertices has at least 
i^k + i)n edges. 



1 Introduction 

Chen, Grigni, and Papadimitriou [2] studied a modified notion of planarity, in 
which two nations of a (political) map are considered adjacent when they share 
any point of their boundaries (not necessarily an edge, as planarity requires). 
Such adjacencies define a map graph (see [3] for a comprehensive survey of known 
results on map graphs). The map graph is called a k-map graph for some positive 
integer /c, if no more than k nations on the map meet at a point. As observed 
in [2], planar graphs are exactly 3-map graphs, and the adjacency graph of the 
United States is nonplanar but is a 4-map graph. 

The above definitions of map graphs and /c-map graphs may be not rigorous 
enough. For this reason, we give a different rigorous definition. Consider a bi- 
partite graph B = (V,P;Eb) with vertex sets V,P and edge set Eb CVxP. 
The half-square of B is the simple graph G with vertex set V and edge set 
E = {{ui,W2} I wi G U, U2 G V, and they have a common neighbor in B}. A 
graph is a map graph if it is the half square of a bipartite planar graph. For a 
positive integer k, a graph is a k-map graph if it is the half square of a bipartite 
planar graph B = (V,P;Eb) in which each p G P is adjacent to at most k 
vertices in V. 

Chen [1] proved that for every integer k > 4, each fc-map graph with n > k 
vertices has at most kn — 2k edges. It is natural to ask whether this bound is 
tight or not. Indeed, this question is one of the open questions asked in [3]. In this 

* The full version can be found at http://rnc.r. dendai. ac.jp/~chen/papers/kmap. pdf 
** Supported in part by the Grant-in-Aid for Scientific Research of the Ministry of 
Education of Japan, under Grant No. 14580390. 
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paper, we show that this bound is tight when fc = 4. Moreover, we laboriously 
show that for each large enough fc, this bound is not tight. More precisely, we 
show that for every 0 < e < ^ and for every integer fc > each fc-map graph 
with n > fc vertices has at most (||| + e)kn — 2k edges. This result has many 
important consequences. For example, it implies that the arboricity of a fc-map 
graph is at most (||| -I- e) • fc. Moreover, it implies a linear-time algorithm for 
coloring a given fc-map graph with at most 2 • ( ||| -|- e) • fc colors for every fc > . 

Previously, there was no polynomial-time algorithm for coloring a given fc-map 
graph with less than 2k colors, although Sanders and Zhao [4] showed that each 
fc-map graph can be colored with at most [|fc] colors. The proof by them does 
not lead to a polynomial-time algorithm for coloring a given fc-map graph G, 
because their proof requires that G be given together with its embedding in the 
plane (or sphere) while it is still unknown how to construct a plane (or sphere) 
embedding of a given fc-map graph in polynomial time. 

The above new upper bound (namely, (||| -I- e)kn — 2k) may look too loose 
at first glance. Indeed, since each fc-map graph can be colored with at most 
[|fc] colors [4], one may be tempted to show that each fc-map graph with n 
vertices has at most [|fc]n edges. A little unexpectedly, we can show that for 
every positive multiple fc of 6, there are infinitely many integers n such that 
some fc-map graph with n vertices has at least (^fc -I- |)n edges. 

2 Basic Definitions 

Throughout this paper, a graph may have multiple edges but no loops, while a 
simple graph has neither multiple edges nor loops. Our terminology is standard 
but we review the main concepts and notation. Let G be a graph. A bridge of G 
is an edge of G whose removal increases the number of connected components in 
G. G is bridgeless if it has no bridge. Let u be a vertex in G. The degree of v in G, 
denoted by da{v), is the number of edges incident to v. Note that dc{v) may be 
larger than the number of neighbors of u in G because of multiple edges. Vertex 
V is isolated in G if dciv) = 0. For U C V{G), the subgraph of G induced by U 
is the graph ([/, Eu) where Eu consists of all e G if such that both endpoints of 
e are in U. A cycle of G is a connected subgraph G of G such that each vertex 
of G is incident to exactly two edges of G. A path of G is a simple connected 
subgraph P of G such that P is not a cycle and each vertex u of P satisfies 
dp{v) < 2. The length of a cycle (respectively, path) is the number of edges on 
it. The internal vertices of a path P are those vertices v on P with dp{v) = 2. 

A graph is planar if it can be embedded in the plane (respectively, sphere) so 
that any pair of edges can only intersect at their endpoints; a plane (respectively, 
sphere) graph is a planar one together with such an embedding. Let ^ be a sphere 
graph. Consider the set of all points of the sphere that lie on no edge of Q. This 
set consists of a finite number of topologically connected regions; the closure 
of each such region is a face of Q. Let P be a face of G. We denote by V{P) 
(respectively, E{F)) the set of all vertices (respectively, edges) of G that are 
contained in F. The size of P is |V(P)|. Let Pi and P 2 be two faces of Q. Pi and 
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F 2 touch if V{Fi)r\V {F 2 ) is not empty. and F 2 strongly touchif E{Fi)r\E{F 2 ) 
is not empty. Obviously, if Fi and E 2 strongly touch, then they touch. However, 
the reverse is not necessarily true. When Ei and F 2 strongly touch in Q, merging 
Fx and F 2 is the operation of modifying Q by deleting all edges in E{Fi)fiE{F 2 ). 

Fix an integer fc > 3. A k-face sphere graph is a bridgeless sphere graph with 
no face of size larger than k. Let Ti. he a k-face sphere graph. Let F be a face of 
H. F is small if |H(F)| < [I]. F is large if it is not small. F is critical if it is 
small and strongly touches exactly two faces of . F is dangerous if it is critical 
and strongly touches a face of size less than k. We classify critical faces F into 
three types as follows: 

• Type 1: The boundary of F is a cycle. (Comment: The two faces strongly 
touching F may or may not strongly touch.) 

• Type 2: The boundary of F is formed by two vertex-disjoint cycles. 

• Type 3: The boundary of F is formed by two edge-disjoint cycles and the two 
cycles share exactly one vertex. 

The map graph of Ti. is the simple graph whose vertices are the vertices of 
Ti and whose edges are those {vi,V 2 } such that there is a face F in Ti with 
{vi,V2}CV{F). 

Lemma 1. Let k be an integer larger than 2, and let G = (V, E) he a connected 
k-map graph with at least k vertices. Then, there is a k-face sphere graph Ti = 
{V,Eu) whose map graph is a supergraph ofG. 

3 A New Upper Bound 

In this section, we show the following theorem: 

Theorem 1. Let 0 < e < and let k be an integer not smaller than 
Then, for every k-face sphere graph Ti with at least k vertices, the map graph of 
Ti has at most (||| -\- e)kn — 2k edges, where n is the number of vertices in Ti. 

We prove Theorem 1 by induction on n + f , where / is the number of faces 
in Ti. In the base case, we have n = k and / = I, and hence the map graph of 
Ti has exactly edges. 

For the induction step, suppose that n-\- f > k. The following lemma deals 
with two simple cases. 

Lemma 2. The map graph ofTi has at most (||| -|- e)kn — 2k edges if at least 
one of the following conditions holds: 

1. There are two strongly touching faces Fi and F 2 in Ti such that merging 
them does not creat a face of size larger than k. 

2. Ti has a small face F such that V{F) contains an isolated vertex v. 

By the above two simple cases, we may make the following assumption: 
Assumption 1 Neither condition in Lemma 2 holds. 
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By Assumption 1, no two small faces of Ti can strongly touch, and hence 
each critical face strongly touches two large faces in Ti. Moreover, each face of 
TL has size at least 3. 

Corollary 1 . No small face strongly touches exactly one face in TL. 

Lemma 3 . Suppose F is a dangerous face ofTL. Let Fi and F2 be the two faces 
strongly touching F in TL. Then, we can shrink F and enlarge either one or both 
of Fi and F2 without modifying the other faces (than F, F\, and F2) of TL to 
obtain a new k-face sphere graph TL' such that 

• TL' has n vertices and f faces, 

• both Fi and F2 have size k in TL' ( and so F is not a dangerous face of TL' ), 

• each dangerous face of TL' is also a dangerous face of TL, and 

• the map graph ofTL' has at least as many edges as the map graph ofTL. 

In Lemma 3, we can indeed prove that Assumption 1 remains to hold after 
replacing TL therein by TL' . But the proof is unnecessary here, because even 
when Assumption 1 does not hold after replacing TL therein by TL' , we can apply 
Lemma 2 to TL' . In more detail, since TL' has the same numbers of vertices and 
faces as TL does, we can modify the proof of Lemma 2 to prove that the map 
graph of TL' has at most (||| + e)kn — 2k edges. In turn, the map graph of TL 
has at most (||| + e)kn— 2k edges because the map graph of TL' has at least as 
many edges as the map graph of TL. 

So, we may assume that in Lemma 3, Assumption 1 remains to hold after 
replacing TL therein by TL' . Then, by repeatedly applying Lemma 3, we can make 
the following assumption: 

Assumption 2 TL has no dangerous face. 

Lemma 4 . Suppose that a critical face F of TL has a vertex Vi on its boundary 
such that Vi has degree more than 2 in TL and is on the boundary of exactly one 
(say, Fi) of the two faces F\ and F2 strongly touching F in TL. Then, we can 
modify F and F\ without modifying the other faces (than F and F\) to obtain a 
new k-face sphere graph TL' such that 

• TL' has n vertices and f faces, 

• Fi still has size k in TL' while the size of F in TL' is larger than the size of 
F in TL by 1, 

• F strongly touches F\, F2, and exactly one face F3 ^ {^1,^2} in TL' , 

• each critical face ofTL' other than F3 is also a critical face ofTL, 

• no face other than F^ is a dangerous face in TL' , and 

• the map graph ofTL' has at least as many edges as the map graph ofTL. 

In Lemma 4, T 3 may be a critical or even dangerous face of TL' . However, 
this does not cause us any problem. To see this, suppose that F 3 is a critical or 
dangerous face of TL' . Then, merging F 3 and F in TL' does not yield a face of 
size larger than k because the size of T 3 (respectively, F) in TL' is at most |"|] 
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(respectively, [|] + 1) and hence merging them in Ti! yields a face of size at most 
rfl + (r|l +1) — 2 < k. In turn, since Ti.' has the same numbers of vertices and 
faces as Ti does, we can modify the proof of Lemma 2 to prove that the map 
graph of Ti' has at most (||| + e)kn — 2k edges. Consequently, the map graph of 
Ti has at most (||| + e)kn — 2k edges because the map graph of Ti' has at least 
as many edges as the map graph of Ti. 

So, we may assume that in Lemma 4, does not become a critical or dan- 
gerous face of Ti' , and both Assumptions 1 and 2 remain to hold after replacing 
Ti therein by Ti' . Then, by repeatedly applying Lemma 4, we can make the 
following assumption: 

Assumption 3 For every critical face F ofTi, each vertex shared by the bound- 
aries of F and exactly one of the two faces strongly touching F in Ti has degree H 
in Ti. 

Let k 2 = [f] • Recall that Ti has no face of size at most 2. For each j with 
‘IT> If j <k, let fj be the number of faces of size j in Ti. For each j with 3 < j <k 2 , 
let fj^c be the number of critical faces of size j in Ti. Let no be the number of 
isolated vertices in Ti. 

Lemma 5. Y!j=zi.j ~ ‘^)fj < 2n - no - 4. 

Fix a large constant (3 independent of k. The choice of (3 will become clear 
later. 



3.1 The Case Where “ 2)/j,c > f 

Let Ti' be a copy of Ti. For each face F of Ti, let F' denote the face of Ti' 
corresponding to F. We modify Ti' by performing the following steps: 

1. For each Type-1 critical face F of Ti, if each of Pp and Qp has at least one 
internal vertex where Pp and Qp are the two edge-disjoint paths shared by 
F and the two faces strongly touching F in Ti, then triangulate F' in Ti' by 
adding |R(T’)| — 3 new edges {x,y} such that x is an internal vertex of Pp 
and y is an internal vertex of Qp; otherwise, triangulate F' in Ti' arbitrarily. 

2. For each Type-2 critical face F of Ti, triangulate F' in Ti' by adding |R(T’)| 
new edges {x,y} such that x is on Pp and y is on where Pp and Qp are 
the two vertex-disjoint cycles forming the boundary of F. 

3. For each Type-3 critical face F, triangulate F' in Ti' by adding |R(A)| — 2 
new edges {x,y} such that x is on Pp but not on Qf and y is on Qf but 
not on Pp where Pp and Qp are the two edge-disjoint cycles forming the 
boundary of F. 

4. For each noncritical face F, triangulate F' in Ti' by adding at least (|R(F) | — 
3) -|- ip new edges, where ip \s the number of isolated vertices in F. 

How many edges are in the map graph of Ti but are not in the map graph of 
Ti'T To answer this question, we first prove the following lemma: 
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Lemma 6. Let F he a face ofTi. Then, the following statements hold: 

1. If F is a Type-1 critical face ofTL, then after Step 1 above, the map graph 

ofTL has at most (|y(_F)| — 3) edges between vertices in V{F) that 

are not edges in the map graph ofTL' . 

2. If F is a Type- 2 critical face of TL, then after Step 2 above, the map graph 

ofTL has at most l^(^)l edges between vertices in V{F) that are not 

edges in the map graph ofTL'. 

3. If F is a Type-3 critical face ofTL, then after Step 3 above, the map graph 

ofTL has at most — — (|y(J^)| — 2) edges between vertices in V{F) that 

are not edges in the map graph ofTL'. 

4 . If F is a noncritical face ofTL, then after Step 4 above, the map graph of 

TL has at most between vertices in V (F) that are not 

edges in the map graph ofTL' . 

Since TL' becomes a triangulated sphere graph, its map graph has at most 

3n — 6 edges. Moreover, |h^(.P)| + 3 > l^(-F’)! if and only 

if |M(F)| < 6.5. Thus, by Lemma 6, we have the following corollary. 

Corollary 2. The number of edges in the map graph ofTL is at most 



3n — 6 + 



(j-2)(j-3) 



i=3 



ifj ~ /i,c) +y~!( 

3=3 






~j+3)/i,c+y^(^-j)/i,c- 

i=7 



Now, we are ready to show a key lemma. 

Lemma 7. Suppose that ~ Then, the map graph ofTL has 

at most (1 ~ ^ + 8ffc)^n — 2 k edges. 

Proof. Since k > ^ and e < we have k > 373 and fc2 > 186. In turn, 
for every j G {3,..., 6}, j + 3 < — 2). Moreover, for every 

j G {7, 8 , ... , ^2}, \ — j < — 2). Thus, by Corollary 2, the number of 

edges in the map graph of TL is at most 



3n - 6 + ^(j - 2){fj - /yc) + 

3=3 

„ „ ^ — 3,„ , n, k 2 — 2 n 

<3n-6+— (2„-4--) + — 

„ „ ^ — 3,„ . n, k — 3 n 

<3„-6+—(2„-4--) + — .- 



k2 



y^(j 2)/j,c 

3=3 



The first inequality above follows from Lemma 5, the assumption that ~ 

‘^)fj,c > and the observation that The second inequality above 

follows from the observation that > k 2 — 2 . □ 
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3.2 The Case Where - 2)/j,c < % 

Let Tic be a copy of Ti. We modify Tic by performing the following steps for each 
critical face F of Tic'- 

1. Find a face F\ strongly touching F such that the number hp of vertices 
shared by F and F\ only is not smaller than the number ip of vertices 
shared by F and F 2 only, where F 2 is the other face strongly touching F in 
He. 

(Comment: Merging F and Fi yields a face of size exactly |M(Fi)| + ip.) 

2. Merge F and Fi into a single face F3, and further delete exactly ip isolated 
vertices that were shared by F and F\ only before the merge. 

(Comment: We claim that the size of F3 is k after this step. To see this, first 
recall that before this step, the size of Fi is k by Assumption 2. Now, by the 
comment on Step 1, the claim holds. By the claim, executing the two steps 
does not creat dangerous faces.) 

By the above two steps, we have the following lemma immediately. 

Lemma 8. He has no critical face. Moreover, Assumption 1 remains to hold 
even after replacing H therein by He. 

The next lemma will be used to show that the number of edges in the map 
graph of He is not so smaller than the number of edges in the map graph of H. 

Lemma 9. The number of edges in the map graph ofH is larger than the number 
of edges in the map graph of He by at most ^k{n — nn), where n-p is the number 
of vertices in He • 

A crucial point is that the number of edges in the map graph of He is not so 
large. We will show this below. 

A trouble with He is that some large face in He may have very few edges on 
its boundary. So, instead of He, we will work on a new fc-face sphere graph TZ 
defined as follows. Initially, 7^ is a copy of He. We then modify this initial TZ by 
performing the following step for each large face A in it: 

— If F has at least [ isolated vertices, then use new edges to connect the 
isolated vertices in F into a cycle Cp, in such a way that F is split into two 
faces F' and F” one of which (say, F') has no vertex in its interior and has 
Cp as its boundary. 

(Comment: F” contains the same vertices as F did, and hence is large. 
Moreover, F” contains no isolated vertex. On the other hand, F' may be a 
small face, but its boundary must contain at least |" edges. Moreover, 
F' strongly touches F” only.) 

By the above comment, the map graph of He is the same as the map graph 
of TZ. So, it suffices to work on TZ instead of He. 

We classify the small faces of TZ into two types as follows. For each small face 
F of TZ, if F is also a small face of He, then we say that F is old; otherwise, we say 
that F is new. The following lemma is clear from Lemma 8 and the construction 
of F: 
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Lemma 10. The following statements hold: 

1. No two small faces ofTZ strongly touch in TZ. 

2. Each old small face ofTZ strongly touches at least three large faces ofTZ. 

3. Each new small face of TZ strongly touches exactly one large face of TZ, 

and its boundary contains at least edges. 

4- Each large face ofTZ has at most —1 isolated vertices. Consequently, 

the boundary of each large face contains at least (fe + 2) — [ edges. 

Note that the number nn in Lemma 9 is also the number of vertices in TZ. 
We need several more notations. Let 

• mn be the number of edges in TZ, 

• \j be the number of faces of size j in TZ for each j > 3, 

• Abig be the number of large faces in TZ, 

• Anew be the number of new small faces in TZ, 

• Aoid be the number of old small faces in TZ, 

• TZ* be the dual graph of TZ, 

• S* be the underlying simple graph of TZ* (i.e., S* is the simple graph obtained 
from G by deleting multiple edges), and 

• ms* be the number of edges in S*. 

Lemma 11. m-j^ ;^(9Abig “t~ Anew) and ms* C: 9Abig “t~ Anew- 

By Lemma 11, the average number of multiple edges between a pair of adja- 
cent vertices in TZ* is at least which is large if k is large. Moreover, if there 
are many multiple edges between a pair {Fi,F 2 } of adjacent vertices in TZ* , then 
the two faces Fi and F 2 of TZ must share many vertices. This is a key for us to 
show that the map graph of TZ does not have so many edges. We will clarify this 
below. 

Let TZ' be a copy of TZ. For each face F of TZ, let F' be the face of TZ' 
corresponding to F. We modify TZ' by triangulating each face of TZ' . We want 
to estimate how many edges are in the map graph of TZ but are not in the map 
graph of TZ' . 

Lemma 12. If we modify TZ by triangulating exactly one face F, then the num- 
ber of edges in the map graph of TZ decreases by at most ^ 

Consequently, the number of edges in the map graph ofTZ is at most Snu -6-1- 

(j-2)(j-3) , 

Z^j=3 2 N ■ 

The above estimate of the number of edges in the map graph of TZ is too 
pessimistic. The next key lemma gives a better estimate. 

Lemma 13. Let F\ and F 2 be two strongly touching faces ofTZ. Suppose that 
there are r \^2 multiple edges between Fi and F 2 in TZ* . Then, the following hold: 
1. There is a set Xi ^2 of at least ri,2 vertices shared by the boundaries of Fi 
and F 2 such that no two vertices of X \^2 ore shared by the boundaries of two 
faces F3 and F 4 ofTZ with {^1,^2} 7^ {Fs,F 4 }. 
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2. If we modify TZ by triangulating F\ and F 2 only, then the number of 

edges in the map graph of TZ decreases by at most J2i=i — 

r?,2“'^ri,2 + 12 

2 ■ 

3. The map graph ofF has at most Sn-jz — 6 + Ylj =3 ~ 

edges. 

Proof. We prove the three statements in turn as follows. 

(Statement 1) Consider the subgraph of F whose edges are the edges shared 
by the boundaries of F\ and F 2 and whose vertices are the endpoints of these 
edges. The subgraph is either a cycle of length ri _2 or a collection of vertex- 
disjoint paths each of which has length at least 1. In the former case, we set 
X 12 to be the set of vertices on the cycle. In the latter case, we traverse the 
vertex-disjoint paths clockwise; during the traversal of each path, we let X \^2 
include all vertices except the last one on the path. Obviously, in both cases, 
Xi ^2 is as required. 

(Statement 2) By Lemma 12, triangulating Fi (respectively, F 2 ) only causes 
the map graph of TZ to lose at most (I^(-P'i)I-2K|V"(Fi)|- 3) (j-ggpectiveiy^ 
(|V(F2)|-2K|V(F2)|-3) Pqj. j g {1,2}, call the quantity 

(T(G)|-2K|y(F.)|-3) pessimistic loss of Fi. 

Let /C be a k-face sphere graph obtained from TZ by triangulating Fi and 
F 2 only. Let Ei (respectively, E 2 ) be the set of edges in /C between vertices in 
V{Fi) (respectively, ^(^ 2 )). Let u and v be two vertices in Xi^ 2 - By the proof 
of Lemma 12, we have the following observations: 

• Suppose that Ei U E 2 contains an edge between u and v. Then, for each 
i G {1,2}, the edge {u,v} is not counted in the pessimistic loss of Tj. 

• Suppose that Ei U E 2 contains no edge between u and v. Then, for each 
i G {1,2}, the edge {u,v} is counted in the pessimistic loss of Tj. 

Also note that there are at most 3|Ai 2 | — 6 (unordered) pairs {m, w} C 
X 12 such that El U E 2 contains an edge between u and v. Thus, by the above 
observations, at least I^u 2 l(l^i, 2 |-i) _ ('3| 2 ] — 6) edges are counted in both the 

pessimistic loss of Fi and the pessimistic loss of F 2 . Hence, triangulating Fi and 
F 2 only causes the map graph of TZ to lose at most X)i=i ~ 

<2-7^1,2 + 12 

(Statement 3) Let 'P be the set of all (unordered) pairs {Fi,Fj} such that Tj 
and Fj are strongly touching faces of TZ. For each pair {Fi, Fj} G 'P, let be 
the number of edges shared by the boundaries of F) and Fj in TZ, and let Xij 
be a set of at least rij vertices shared by the boundaries of < and Fj such that 
no two vertices of Xij are shared by the boundaries of two faces and Fji of 
TZ with {Fi,Fj} yf {Fj/j/p/}. Xij exists because of Statement 1. 

By Statement 2, the pessimistic estimate (namely, il^(F)|-2X|y(F.)|-3) _j_ 
(A(F)l-2K|y(Fj)|-3) ^ number of edges in the map graph of TZ after 

r? • — 7r ■ - + 12 

triangulating Fi and Fj only, overcounts at least — — ^ edges. We associate 

these overcounted edges with Xij. 
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Let {Fi,Fj} and {Fi>,Fj'} be two distinct pairs in V. By our choices of Xij 
and Xi'j', the set of the overcounted edges associated with Xij does not intersect 
the set of the overcounted edges associated with Xi^ji. Thus, by Lemma 12, the 
number of edges in the map graph of TZ is at most in-ji — 6 + X)j =3 ~ 

— 7r- +12 

Y.{Fi,Fj}ev ~ — \'^\ = "^-S* and Y.{Fi,Fj}ev = '^T^- Thus, 
by simple calculus, the number of edges in the map graph of TZ is at most 

inn — 6 + Aj — 2 +r + ~ Qms*, and hence is at most 

inn — 6 + X)j =3 by Lemma 11. □ 

Corollary 3. The map graph ofF has at most {^k + ^)nn — 2fc edges. 

Lemma 14. Suppose that Then, the number of edges in 

the map graph ofTL is less than (|| + + -^^^^ip-)kn — 2k. 

Proof. When constructing F from Ti, we deleted at most vertices from 

each critical face F of H. Thus, n — nn < L > where the summation 

is taken over all critical faces F of H. In turn, since +(1^)1 > 3, n — nn < 

- 2) = E -laO' - 2)A,,c. So, nn > ^n. 

Now, by Lemma 9 and Corollary 3, the number of edges in the map graph 
of H is at most |/c(n — nn) + (|§fc + ^)nn — 2k= ^kn — {^k — ^)nn — 2k< 
(m T W ~ 2^1 because k > 373. □ 

Setting /3 = 41 and combining Lemmas 7 and 14, we finally have Theorem 1. 
By Theorem 1 and Lemma 1, we have the following corollary: 

Corollary 4. Let 0 < e < and let k be an integer not smaller than 
Then, every k-map graph G with at least k vertices has at most (||| + e)kn — 2k 
edges, where n is the number of vertices in G. 

4 Lower Bounds 

Theorem 2. For every multiple n of 4 with n>8, there is a 4-wiap graph with 
n vertices and 4n — 8 edges. 

Theorem 3. For every multiple k of 6, there are infinitely many integers n such 
that some k-map graph with n vertices has at least (^fc+ ^)n edges. 

References 

1. Z.-Z. Chen. Approximation algorithms for independent sets in map graphs. J. 
Algorithms, 41:20-40, 2001. 

2. Z.-Z. Chen, M. Grigni, and C.H. Papadimitriou. Planar map graphs. Proceedings 
of the 30th ACM Symposium on Theory of Computing, pp. 514-523, 1998. 

3. Z.-Z. Chen, M. Grigni, and C.H. Papadimitriou. Map graphs. J. ACM, 49:127-138, 
2002. 

4. D. P. Sanders and Y. Zhao. A new bound on the cyclic chromatic number. J. 
Combinatorial Theory Ser. B, 83:102-111, 2001. 




Dynamic Storage Allocation 
and On-Line Colouring Interval Graphs 



N.S. Narayanaswamy* 

Department of Computer Science and Engineering 
IIT-Madras, Chennai-600 036, India 



Abstract. We present an improved on-line algorithm for colouring in- 
terval graphs with bandwidth. This problem has recently been studied 
in [1] and a 195-competitive online strategy has been presented. We im- 
prove this by presenting a 10-competitive strategy. To achieve this result, 
we use the online colouring algorithm presented in [8, 9]. We also present 
a new analysis of a polynomial time 3-approximation algorithm for Dy- 
namic Storage Allocation(DSA) using features of the optimal on-line 
algorithm for colouring interval graphs [8,9]. 



1 Introduction 

Interval graphs are of interest in many fields as they are used to model the 
structure of many problems. One classical well studied problem is the issue of 
vertex colouring of interval graphs. Features of containment associated with in- 
tervals are well exploited to design efficient algorithms and fast implementations 
to optimally colour interval graphs (see [7]). Interval graphs are used to model 
many resource allocation problems. Each interval corresponds to a request for 
the usage of a resource exclusively for a certain period of time. In this paper we 
consider the issue of online colouring a set of intervals based on some relaxed 
properties (two adjacent vertices may get the same colour without violating an 
additionally specified constraint). Online algorithms are motivated by environ- 
ments where the inputs occur in a sequence and need to be serviced without any 
knowledge of the future inputs. Online interval colouring algorithms are one of 
many such algorithms that are of much interest (see book [2]). Our results use 
an optimal on-line algorithm for colouring interval graphs presented in [8,9]. 

The problem of colouring interval graphs with bandwidths is a generalization 
of the interval graph colouring problem. In the generalization, each interval has 
a weight in (0, 1]. These weights are referred to as bandwidths. A valid colouring 
of the vertices is one in which, for every r on the real line, the total bandwidth 
of the monochromatic intervals containing r is at most 1. Clearly, when each 
interval has bandwidth 1, we get the interval graph colouring problem. This 
problem has been of recent interest in [1] and, as has been remarked there, is a 
simultaneous generalization of online interval colouring and online bin- packing. 
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We refer to a colouring satisfying the above condition as a bandwidth satisfying 
colouring. By a c-bandwidth satisfying colouring, we refer to a colouring which 
satisfies the property that the weight of a maximum clique is upper bounded 
by c in a graph induced by vertices assigned the same colour. A colouring that 
assigns different colours to the end points of each edge is simply referred to as a 
valid colouring. 

An instance of DSA is / = {(di, Si, ri), . . . , (d„, s„, r„)} where the f-th inter- 
val, denoted by A, is [si,ri\. di is a non-negative integer that can be visualized 
as the number of memory locations requested by a process which is alive during 
the time interval A. We use the word demand to refer to the memory requests. A 
solution to this problem instance is a vector of non-negative integers (6i, . . . , 6„) 
such that, for 1 < i, j < n, the intervals [6^,6^ -I- di] and -I- dj] are non 

intersecting whenever the intervals li and Ij have non-empty intersection. We 
call such an assignment (6i, . . . , &„) valid assignment and max {bi + di) is called 

i 

the height of (6i, . . . , &„). The optimization problem is to find a minimum height 
valid assignment. The letters b,d,s, and r are chosen as they are suggestive of 
Base memory address. Demand, Starting time, and Release time, respectively. 
An instance of DSA is said to be aligned if all demands are powers of 2. An aligned 
assignment for an aligned instance of DSA is a valid assignment (6i, . . . , &„) in 
which each bi is an integer and di divides bi. Finally, DSA is a well motivated 
classical problem in computer science [10] studied in the settings of compile time 
memory optimization by Gergov [6], and bandwidth allocation by Leonardi et 
al. [11]. 

The representation of the input instance used in the DSA can be used to rep- 
resent an instance for the bandwidth satisfying colouring problem by modifying 
di to be a number between (0,1]. Geometrically, (di,Si,ri) represents an axis 
parallel rectangle of height di and width — s^. The input I can be visualized 
as an arrangement of rectangles, axis parallel unless mentioned otherwise, on 
the a;-axis (referred to as the time axis). The required solution for the colouring 
problem is to assign to each interval a colour such that in the graph induced 
by vertices assigned the same colour, the maximum clique weight is at most 1. 
Geometrically, we can view each colour as corresponding to a unit height strip 
parallel to the time axis on the plane. Indeed, the strips corresponding to dif- 
ferent colours are to be disjoint. The target is to use as few colours as possible. 
For DSA, the required solution is an arrangement in the plane such that rectan- 
gles that intersect along the time axis are disjoint and the position of rectangle 
(dijSijVi) is obtained by translation along the y-axis (referred to as the space 
axis) . The goal is to find a minimum height rectangle that contains this arrange- 
ment. In either case, we are required to translate the rectangles along the space 
axis to satisfy a related condition. 

Previous Work: Online colouring algorithms for interval graphs have been well 
studied. The best known algorithm is in [8, 9] which uses at most 3uj — 2 colours. 
It is also shown to be an optimal online algorithm in [9]. Other approaches 
toward online colouring is to use the first fit algorithm: allot the smallest valid 
colour to each vertex. Online colourings with the bandwidth constraint have 
been studied recently [1]. Their work uses a combination of the optimal online 




Dynamic Storage Allocation and On-Line Colouring Interval Graphs 331 



algorithm and the first fit approach. This algorithm achieves a constant, at most 
195, competitive ratio. 

For DSA, it is well known that finding the value of the optimum solution is 
NP-hard [4]. When the demands are all the same, the problem is the same as 
interval graph colouring which is well known to be polynomial time solvable [7]. 
On the other hand, the NP-hardness holds when each demand is an element of 
a set of two distinct integers. The best known approximation algorithms are a 
factor I algorithm for aligned DSA and a factor 3 algorithm for DSA, both due 
to Gergov [5,6]. 

Our Work and Results: We design a new online algorithm to colour interval 
graphs with the bandwidth constraint. Our approach is similar to that in [1] 
where the requests are classified into two subproblems based on the bandwidth. 
One subproblem is coloured using the first fit, and the other using the optimal 
on-line colouring algorithm. Each subproblem uses a distinct set of colours. In 
our approach, we classify the input into 3 classes, again based on bandwidth. We 
then appropriately apply the optimal on-line algorithm to each class, operating 
a different set of colours per class. The on-line algorithm is modified to suit 
the needs of each class, and this modification is what leads to an improved 
performance. The algorithm which is shown to be at most 10-competitive, a 
significant improvement. 

Our result on the DSA stems from attempts to analyze the algorithm pre- 
sented in Gergov’s short paper [6]. Prior to this 3 approximation algorithm 
for DSA, the best known results were 6 approximation algorithms designed by 
Kierstead [8] and Slusarek [12]. The approach taken by Kierstead was to first 
convert an instance of DSA to an instance of aligned DSA by rounding each 
demand to the nearest larger or equal power of 2. Then the instance of aligned 
DSA was approximated within a factor of 3 using features of an on-line colouring 
algorithm. The two steps put together yield the claimed 6 approximation algo- 
rithm. In this work we use Kierstead’s approach in an appropriate way to yield 
a 3-approximation algorithm for DSA. This algorithm, except for some minor 
differences, is same as the algorithm in [6]. Gonsequently, our contribution here 
is to present Gergov’s work in the setting of the previous efforts at DSA, and an 
elegant proof along the same lines as Kierstead’s analysis [8] . 

Plan: In Section 2 we present the required preliminaries and outline Kierstead’s 
on-line colouring algorithm. The online algorithm for colouring with bandwidth 
constraint is present in Section 3. Finally, the approximation algorithm for DSA 
and its analysis is presented in Section 4. 



2 Preliminaries 

Without loss of generality, an interval is a set of consecutive non-negative in- 
tegers. For a set of intervals P = {li '■ 1 < i < n}, we can associate a graph 
denoted by G{P). In this graph, there are as many vertices as the number of 
intervals in P. Each interval is associated with a unique vertex and vice versa. 
Two vertices are adjacent if there is a non-empty intersection between the cor- 
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responding intervals. Similarly, for a set of rectangles we associate a rectangle 
graph. 

The collection of intervals we are interested in are those that occur in an in- 
stance of the DSA. n denotes the number of intervals in an instance of DSA. For 
an instance / = {{di, Si, r,) : 1 < z < n}, G{I) denotes the vertex weighted inter- 
val graph as defined previously. Clearly, a vertex weighted interval graph with 
non-negative integer weights yields an instance of the DSA. Also, the weighted 
intervals in / and weighted vertices in interval graphs correspond to rectangles, as 
mentioned in the introduction. Consequently, we use these notions interchange- 
ably for ease of presentation. 

For an undirected graph G, oj{G) denotes the size of the maximum cardinality 
clique in G. A{G) denotes the max{ degree of zt : zz G V{G)}. For a weighted 
graph G, 0 J*{G) denotes the largest weighted clique in G. X*{G) denotes the 
height of a valid assignment of minimum height. It is mentioned in Kierstead [8] 
that there is a weighted interval graph G for which x*(G) > |w*(G). Note that 
in an unweighted interval graph (all weights equal to 1), x(G) = w(G). 

Let P be a collection of intervals. We now define three notions of density with 
respect to the collection P. For a positive integer r, the density of r is defined to 
be D{r) = |{/ G P : r G /}|. The density of an interval / is defined as D{I) = 
min{D(r) : r G /}. The density of P, D{P), is defined as max{D{I) : I G P}. 
We present the following crucial lemma from [8] for the sake of clarity. 

Lemma 1. Let P he a collection of intervals and G = G{P). If D{P) = 1 then 
w(G) < 2 and A{G) < 2. 

Proof. Since D{P) = 1, every interval I has a point r such that the point is 
present exclusively in I. That is, D{I) = 1 for every I G P. Therefore, if u;(G) > 3 
then there exist 3 interval 1 1,12,1s corresponding to vertices in G which form a 
triangle (a 3-clique). It follows that either /i C /2 U Is or I 2 U Is Q Ii- In either 
case we have identified an interval in P each of whose points is contained in 
another interval of P. This means there is an interval I G P such that D{I) > 2. 
This contradicts the hypothesis. Therefore, u;(G) < 2. Similarly, a vertex of 
degree at least 3 implies that there is an interval which is contained in the union 
of at most two other intervals. This would violate the hypothesis that D{P) = 1. 
Therefore, A{G) < 2. 



2.1 On-Line Colouring Interval Graphs 

It is well known that an interval graph can be coloured optimally with as many 
colours as the size of its largest clique. The algorithm first orders the vertices 
according to the leftmost point of the intervals associated with them. That is, 
the first vertex in the order is one whose leftmost point is the smallest . Then the 
vertices are considered according to the constructed order and greedily coloured. 
The colouring problem for interval graph becomes more challenging when we 
consider the problem of designing on-line algorithms. Here, along with the input 
interval graph, an order a = Vi, . . . ,Vn is presented. The algorithm must colour 
the vertices according to cr and use as few colours as possible. Below we present 
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the on-line colouring algorithm due to Kierstead for interval graphs which uses 
at most 3w — 2 colours. 

Let (j = v\, . . . ,Vnhe the ordering of the vertices of G. The vertices of G are 
rectangles of unit height and length same as that of the interval associated with 
them. The algorithm can be visualized as running in two phases: In the first phase 
an arrangement of the rectangles satisfying certain properties is computed. In the 
second a rectangle graph of the arrangement is 3-coloured to obtain a colouring 
of the interval graph. 

Phase I - Rectangle arrangement: Let Vi be the current vertex. Here we iden- 
tify a position, p{vi), for Vi depending on p{vi), . . . ,p{vi-i). First, for a non- 
negative integer r, let Gr(vi) be the induced subgraph of G on {vj G V{G) : j < 
i,p{vj) < r,{vi,Vj} G E{G)}. p{vi) is the smallest non-negative integer r such 
that uj{Gr{vi)) < r. It is clear that for r' < p{vi), uj{Gr'{vi)) > r' . Pictorially, 
the rectangle corresponding to Vi sits on the line p{vi). 

Properties of the Arrangement: 

1. For each rectangle v, p(y) < w — 1. 

2. Rectangles sitting on different lines do not intersect. This is true because the 
rectangles all have the same height. 

3. For a number i, consider the collection P of intervals corresponding to the 
vertices of the induced graph on {v\p{v) = i\. This collection has density equal 
to 1. From lemma 1 it follows that maximum vertex degree in this graph is at 
most 2 . 

Phase-II: Colouring the intersection graph: A greedy colouring according to a 
yields a 3-colouring. The reason is that the degree of each vertex in the rectangle 
graph is at most 2 . 

p{vi) depends only on the vertices which were considered prior to Vi and 
the colour assigned to Vi depends on the colour of its neighbours on the line 
p{vi). Further, vertices with p(v) = 0, all get the same colour, as they form an 
independent set. Consequently, it follows that the two phases can be performed 
on-line and at most 3u; — 2 colours are used. 

3 Online Colonring Intervals with Bandwidth 

Recall that the weight of each interval is a number in (0, 1]. The goal is to use 
the minimum number of colours to colour the vertices of an interval graph such 
that, for each colour c, the weight of the maximum weight clique is bounded by 1 
in the graph induced by the vertices assigned c. The algorithm should be an on- 
line strategy, in the sense that at each time step (decided by the algorithm), an 
input pair consisting of an interval and its bandwidth requirement is presented 
to the algorithm. This request should be serviced by assigning a colour before 
the next request can be presented. 

3.1 On-Line Strategy 

The colours to be assigned are split into three classes Ci, C 2 , and C 3 . Let 
a = V 1 V 2 be the sequence of intervals. Let b{vi) denote the associated 
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bandwidth, 0 < b{vi) < 1. When an interval arrives it is processed, according to 
its bandwidth, as follows: 

If b{vi) > is assigned a colour from the class Ci. Any valid bandwidth 

satisfying colouring of the vertices is a valid colouring of the set of intervals. The 
reason being that each bandwidth is more than i. Applying Kierstead’s on-line 
algorithm to colour Vi guarantees that the number of colours used is at most 3 
times the optimum number of colours. 

If j < b(vi) < In this case Vi is coloured using colours from € 2 - The 
colouring strategy described below uses as many colours as the size of the largest 
clique formed by intervals whose bandwidth is in 

Colouring Strategy: Ignore the bandwidth requirements, that is treat all these 
bandwidths as 1, and apply only the rectangle arrangement phase of the online 
colouring algorithm, uj colours are used from C 2 to colour the vertices based on 
this arrangement: for a vertex v with p{v) = y, the y-th colour is assigned. As the 
bandwidths are at most j, we have obtained a bandwidth satisfying colouring. 
Recall that oj denotes the size of the largest clique in the graph induced by the 
intervals in question. 

Lemma 2. The above strategy outputs a bandwidth satisfying eolouring using at 
most to colours. 

Proof As argued in section 2.1, for each y G {0, 1, . . . , w — 1}, the interval graph 
formed by intervals in the set {f|p(w) = y} does not have a clique with more 
than 2 vertices. Further, for each v, p{v) < u) — 1. Clearly, at most oj colours are 
used, and the colouring is bandwidth satisfying as each bandwidth requirement 
is at most Hence the lemma is proved. 

To compare with the optimum bandwidth satisfying assignment, we lower bound 
the number of colours in an optimum bandwidth satisfying colouring in terms 
of the number of colours in an optimum valid colouring. 

Lemma 3. The number of colours used by an optimum bandwidth satisfying 
colouring is at least one-third the number of colours used in an optimum valid 
colouring which is at least uj. 

Proof. In any optimum bandwidth satisfying solution, the largest clique size 
among monochromatic vertices is 3. If it exceeds 3, then such a solution wouldn’t 
be feasible, as the clique weight would be more than 1 due to the bandwidths 
being more than i. To obtain a valid colouring, we optimally colour each colour 
class using at most 3 colours. The number of colours used is at most 3 times the 
optimum bandwidth satisfying colouring. Therefore, the optimum valid colouring 
is at most 3 times the optimum bandwidth satisfying colouring. In other words, 
an optimum bandwidth satisfying colouring uses at least one-third the number 
of colours used by an optimum valid colouring. 

Number of Colours Used: From the previous two lemmas it follows that the 
on-line algorithm uses at most thrice the number of colours used by an optimum 
bandwidth satisfying colouring. The above result is a specific case of a more 
general algorithm called STEADY in [3] . 
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If b(vi) < i; These requests are assigned colours from the set C3. Here we 
modify the on-line colouring algorithm due to Kierstead presented in section 2.1. 

Phase I - Rectangle arrangement: Let v\, . . . ,Vi-\ denote the vertices corre- 
sponding to the intervals preceding Vi - Here we identify a position p{vi), for Vi, de- 
pending on p{vi), . . . ,p{vi-i). First, for a non-negative real number r, let Gr(vi) 
be the induced subgraph of G on {vj G V (G) : j < i,p{vj) < r, {vi, Vj} G E{G)}. 
p{vi) is the smallest real number r such that uj*{Gr{vi)) < r. It is clear that for 
r' < p{vi), Cij*{Gr>{vi)) > r' . Again, pictorially, the rectangle corresponding to 
Vi sits on the line p{vi). 

Lemma 4. For a number y, consider the collection Py of intervals corresponding 
to the vertices of {v\y G [p(u),p(u)-|-6(u))}. For all y, D{Py) = 1. In other words, 
the collection Py has density equal to 1. 

Proof. Let us assume that the statement is false. Consider the smallest y such 
that D{Py) > 1. Let Vi G Py be such that for each x G I{vi), D{x) > 2. Let 
r denote p{vi). Clearly, Lv*{Gr{vi)) < r, and for all r' < r, uj*{Gr'{vi)) > r' . 
Consequently, there is a clique K in Gr{vi) whose weight is r. In particular, 
there is a point z G I{vi) which is common to the intervals corresponding to the 
vertices in K . Further, D{z) = 1; only liyi) G Pr contains z, otherwise weight of 
K would be greater than r, and ui*{Gr{vi)) > r, which is a contradiction to the 
fact that p{vi) = r. On the other hand, D{z) > 2 in Py. Let y' be the smallest 
number in [p{vi),y] for which D{z) = 2 in Pyi. Clearly, there is & Vj,j > i 
such that p{vj) = y' and z G I{vj). Therefore, K U Vi G Gyi{vj). Consequently, 
^*{Gy'{vj)) > r -I- b{vi) = p{vi) + b{vi) > y > y' . This contradicts the rule for 
the choice ofp{vj), and this has arisen because of our assumption that there is a 
y for which D{Py) > 1. The assumption is false, and hence the lemma is proved. 

Lemma 5. The number of colours required by a bandwidth satisfying assignment 
is lower bounded by the ceiling of the height, H , of the arrangement output by 
phase-I. 

Proof. It is clear that there is a clique K of weight H. To see this consider a vt 
such that p{vi) + b{vi) = H. Let r = p{vi). By the argument used in lemma 4, 
there is a clique K in Gr{vi) whose weight is r = p{vi). K \Jvt is also a clique, and 
has weight r -|- b{vi) = p{vi) + b{vi) = iL. In a bandwidth satisfying assignment, 
the set of vertices of K that get the same colour form a clique of weight at most 
1. Consequently, the number of distinct colours used to colour the vertices of 
K must be at least as many as the weight of the clique, which is the height of 
the arrangement. Therefore, the number of distinct colours used is at least the 
ceiling of H. Hence the lemma. 

Phase-II: Colouring the intersection graph: The colouring approach is as follows: 
if I < p{vi) < then Vi is allocated the colour Cj G C3. 

Analysis: Clearly, the above scheme is an on-line strategy, as it processes a re- 
quest fully before considering the next. Our interest is in the number of colours 
that are used by the strategy, and the correctness of the result. The number of 
colours used is at most 4 times the optimum. The reason is that we split the 
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rectangle arrangement output by phase-I into blocks of height ^ each. The num- 
ber of such blocks is 4 times the height of the arrangement, and each block gets 
a separate colour. Consequently, the number of colours used is upper bounded 
by 4 times the ceiling of the height. The ceiling of the height of the arrangement 
is a lower bound on the number of colours used by any bandwidth satisfying 
colouring. Therefore, we use at most 4 times the optimum number of colours. 
Correctness. We now show that in the graph induced by the intervals that 
have been assigned the same colour, the weight of any clique is at most 1. Let us 
assume the contrary, that there is a clique of height greater than 1 among vertices 
assigned the same colour, say Cj. Since this clique is formed by intersecting 
intervals, there is one point common to all these intervals. Let this point be t. In 
the arrangement output by the above procedure, there is a point (t, u), | < u < 
which is occupied by at least 3 rectangles. Otherwise, using the fact that all 
the band widths are at most j, it follows that sum of band widths at point t is at 
most 1. Let us consider the collection P„. Recall that is the set of intervals 
corresponding to the set {u|u G [p{v),p{v) + 6 (u))}. Let h.h, and I 3 be three 
intervals in that contain the point t. A simple property of a set of intervals 
is that if three intervals share a point, then one of the intervals in contained 
in the union of the other two. Let us assume without loss of generality, that 
/i C /2 U I 3 . Clearly, D{Ii) > 2, and D{Pu) > 1. This conclusion contradicts 
lemma 4. Our assumption is wrong and therefore, it follows that the weight of 
any clique formed by vertices assigned the same colour Cj is at most 1 . 
Completing the Analysis: We have split the input sequence into three cases. 
We have also shown that Kierstead’s on-line colouring algorithm can be modified 
to perform competitively on each case. If the optimum for the 3 cases where 
0 \, 02 , and O 3 respectively, then the optimum number of colours used by a 
bandwidth satisfying colouring is lower bounded by each of the 3 optima. Using 
this fact, it follows that the on-line algorithm outlined above uses at most 10 
times the optimum number of colours used by a bandwidth satisfying assignment. 

4 Dynamic Storage Allocation 

Consider an instance I = {vi = {di, Si,ri)\l < i < n} of DSA. The interval 
graph colouring problem is a special case of the DSA in which all the demands 
are the same. Our idea is to consider the vertices in a certain order such that the 
rectangle graph obtained from an appropriate modification, to take into account 
non-trivial vertex weights, of Phase-I of the above online colouring algorithm is 
3-colourable. In the case when all demands are same, this property is guaranteed 
for any order in which we consider the vertices. The algorithm is as follows: 
Initially p{vi) is undefined for all Vi G I and base is a variable initialized to 0. 

1 . While there are unassigned rectangles, do the following steps 2 to 4. Let A 
be current set of vertices for which p{vi) is not undefined. 

2 . For a non-negative integer r, let Gr{vi) be the vertex weighted graph induced 

by {m|u G A,{vi,u\ G E{G),p{u) < r}. A vertex u G Gr{vi) has weight = 

r -I- 1 — p{u) if r -I- 1 — p{u) is at most the demand made by u. Otherwise, is 

the demand made by u. a{vi) is the smallest r > base such that uj*{Gr{vi)) < r. 
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3. Consider a vertex v such a{v) is the least among all vertices not in A. 

4. Add w to A and p{v) is set to a{v). base := p{v). 

5. If there are some rectangles outside A, go back to step 2. 

We now prove a lemma that is used to conclude that the rectangle graph of 
the above arrangement is indeed 3-colourable. Below, we use A to denote the 
arrangement output by the above algorithm for an instance I. We use R(v) 
to denote the rectangular region in A associated with v. For Vi, R(vi) is the 
rectangle whose diagonal is the line joining the points (si,p{vi)) and (ri,p{vi) + 
di). Geometrically, in A, the rectangle Vi sits on the line y = p{vi). Vi is said to 
be assigned to line p{vi). 

Lemma 6. For an integer r, the graph induced by {u|p(u) < r and R{v) n 
{{x^y)\x G I{v),y = r + 1} ^ (f\ has maximum vertex degree upper bounded 
by 2. 

Proof. For our proof, we view a vertex of demand d as d rectangles of unit 
demand. For notation, the rectangles associated with Vi are Vi^i, . . . , Vi^di and for 
each j, 1 < j < di, the interval associated with Vij is [si,rj]. Recall that [si,ri] 
is the interval associated with Vi. Let FI denote this new interval graph. Also, 
it is clear that A yields a rectangle arrangement for FI. In this arrangement, 
= P('^i.i) + j ~ 1 = p(ui) + J — 1, 1 < * < n, 1 < j < dj. For an integer 
t, let F[t{vij) be the induced subgraph of Ff on the vertex set {u G Ff : p(u) < 
t, {u, Vij} G E{F[)}. Now, it is easy to observe the following for each i,j,l<i< 
n,l < j < di, hy induction on j: 

o If t < p{vij), u>{Ht{vij)) > t. 
o If t = p{v^j),uj{Ht{v^j)) < t. 

Now we show that for each t, the density of the set of intervals {u : u G H,p{u) = 
t\ is equal to 1. From lemma 1 it follows that the maximum vertex degree in the 
corresponding interval graph is at most 2. This conclusion immediately implies 
this lemma. We now prove this claim about the density by contradiction. Let 
us assume that for some t, the density of the set A = {m : m G Fl,p{u) = t} 
is more than 1. Let mi G A be an interval whose density is more than 1. Since 
p{u) = t, by definition, uj{F[t-i{ui)) > t. Let AT be a clique of this size from 
Flt-i{ui). Let r be a point which belongs to the interval u\ and the intervals 
that form K . Since ui has density more than 1, r is in another interval rt 2 G A. 
Therefore, U\,U 2 and K form a t -I- 2 clique. Consequently, if u\ was assigned to 
A before U 2 , then Flt{u 2 ) contains the clique K U ui. This is a contradiction to 
the fact that p{u 2 ) = t, since uj{Ht{u 2 )) > t. We arrive at a similar contradiction 
if we consider the case where U 2 was assigned to A before ui, then we have a 
contradiction to the fact that p{ui) = t. Therefore, our assumption is wrong. 
The density of A is 1. Hence the proof of our claim and consequently, the lemma. 

Height of the Arrangement: This height is bounded by the size of a max- 
imum weighted clique in the weighted interval graph associated with /. Let B 
be the height of the arrangement. Let Vi be a rectangle which has a non-empty 
intersection with the line {{x,y)\y = B}. By definition of p{vi), it follows that 
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there is a clique of weight B — di in the neighbourhood of Vi . Consequently, there 
is a clique of weight B in G{I). Therefore, B is at most the size of a maximum 
weighted clique. 

Storage Allocation from A: The solution for / is obtained by greedily colour- 
ing the rectangle intersection graph obtained from A. The vertices are considered 
for colouring in the in the order in which they were entered into A. The fact that 
greedy colouring uses at most 3 colours follows from 6. This yields a solution 
whose height is at most 3 times the size of a maximum weighted clique. Hence 
we have a 3-approximation algorithm for DSA. 



Acknowledgments 

Thanks to Rohit Khandekar, Sunil Chandran, Thomas Erlebach, and Udo 
Adamy. Thanks to Jan Johannsen for his encouragement and support. 



References 

1. Udo Adamy and Thomas Erlebach. Online coloring of intervals with bandwidth. 
In Proceedings of the First Workshop on Approximation and Online Algorithms, 
2003. 

2. A. Borodin and Ran El-Yaniv. Online Computation and Competitive Analysis. 
Cambridge University Press, 1998. 

3. A. Feldmann, B. Maggs, J. Sgall, D.D. Sleator, and A. Tomkins. Competitive 
analysis of call admission algorithms that allow delay. Technical Report CMU-CS- 
95-102, Carnegie Mellon University, 1995. 

4. M.R. Carey and D.S. Johnson. Computers and Intractability - A Guide to the 
Theory of NP-Completeness. Freeman, 1979. 

5. J. Gergov. Approximation algorithms for dynamic storage allocation. In Proe. of 
European Symposium on Algorithms, volume 1136 of LNCS, pages 52-61. Springer, 
1996. 

6. J. Gergov. Algorithms for compile-time memory optimization. In Proe. of 10th 
ACM-SIAM Symposium on Discrete Algorithms, pages 907-908, 1999. 

7. M.C. Golumbic. Algorithmic Graph Theory and Perfect Graphs. Academic Press, 
1980. 

8. H.A. Kierstead. A polynomial approximation algorithm for dynamic storage allo- 
cation. Discrete Mathematics, 88, pages 231-237, 1991. 

9. H.A. Kierstead and W.T. Trotter. An extremal problem in recursive combinatorics. 
In Congressus Numeratium, volume 33, pages 143-153, 1981. 

10. D.E. Knuth. Art of Computer Programming, Vol.l: Fundamental Algorithms. 
Addison- Wesley, 1973. 

11. S. Leonard!, A. Marchetti-Spaccamela, and A. Vitaletti. Approximation algorithms 
for bandwidth and storage allocation problems under real time constraints. In Pro- 
ceedings of the 20th Foundations of Software Technology and Theoretical Computer 
Seience, volume 1974 of LNCS, pages 409-420. Springer, 2000. 

12. M. Slusarek. A colouring algorithm for interval graphs. In Proe. of Ifth Math- 
ematical Foundations of Computer Science, volume 379 of LNCS, pages 471-480. 
Springer, 1989. 




New Approximation Algorithms 
for Some Dynamic Storage Allocation Problems* 



Shuai Cheng Li^, Hon Wai Leong^, and Steven K. Quek^ 

^ School of Computing 
National University of Singapore 
3 Science Drive 2, Singapore 117543 
{lisc ,leonghw}@comp .nus . edu. sg 
^ School of Information Technology 
Nanyang Polytechnic, Singapore 
steven_quek@nyp . edu . sg 



Abstract. The offline dynamic storage allocation (DSA) problem has 
recently received some renewed attention and several new results have 
been reported. The problem is NP-complete and the best known re- 
sult for the offline DSA is a polynomial time 3-approximation algorithm 
[Gerg99]. Better ratios have been reported for special cases if restrictions 
are placed on the allowable sizes of the blocks [Gerg96,MuBh99]. In this 
paper, we present new techniques for solving special cases with blocks 
of restricted sizes and we obtain better approximation ratios for them. 
We first obtain results for small instances which are then used to solve 
the more general cases. Our main results are (i) a 4/3-approximation 
algorithm when the maximum block size h—2 (previous best was 3/2); 
and (ii) a 1.7-approximation algorithm for the case h—3 (previous best 
was lyj)- 



1 Introduction 

The dynamic storage allocation (DSA) problem is a classic combinatorial op- 
timization problem that has a wide variety of applications, including dynamic 
memory allocation in operating systems ([Robs74,Knut73,WJNB95]). 

Our study an offline DSA is motivated by the berth allocation problem (BAP) 
in busy container transshipment ports (such as Singapore) . Ships arriving at the 
port have fixed arrival and departure times and have to be berthed with a 
(straight-line) section of the port so that they can unload and load containers. 
Each vessel must be berth entirely within the section. Effective allocation of 
berthing space to the ships is an important planning task in many ports in order 
to increase throughput, reduce delay, and optimize the use of scarce resources 
(wharf areas, cranes) . 

We show that the BAP is precisely the offline DSA problem. This DSA prob- 
lem was first studied by Knuth, Robson and others [Robs74,Knut73] and has 

* This work was supported in part by the National University of Singapore under 
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recently received some renewed attention. The problem is NP-complete [GJ79] 
and there were a series of approximation algorithms proposed in recent years 
[Kier88,Kier91,Gerg96,Gerg99]. The best known result is a 3-approximation al- 
gorithm by Gergov [Gerg99]. Better ratios have been reported for special cases 
if restrictions are placed on the allowable sizes of the blocks [Gerg96,MuBh99]. 

In this paper, we present new techniques for solving more special cases (of 
blocks with restricted sizes) that produce better approximation ratios. Our mo- 
tivation for studying special case are twofold: Firstly, we hope that results of 
small special cases will lead to better approximation algorithms for these special 
cases. Secondly, our research [Li02] has shown that these approximation algo- 
rithms for special cases can be adapted to solve the general case and produce 
good results in practice for the berth allocation problem. 

2 Problem Formulation and Previous Results 

The offline DSA problem is formulated as follows^: An instance B of the offline 
DSA is given by a set of blocks B = {Bi,B 2 , ■ ■ ■ , B„}. Each block Bj (1 < j < n) 
is a triple of nonnegative integers {sj,aj, dj) where Sj is the size of the requested 
block while Oj and dj are the arrival time and departure time of Bj . A (memory) 
allocation b{B) = ( 6 i, 62 , • • • , &n) is a mapping that assigns to each block Bj a 
nonnegative integer location bj such that for all i,j, (1 < i < j < n), either 
[oi,di) n [aj,dj) = 0 or {bi,bi + Si) n {bj,bj + Sj) = 0. Note that in the online 
version of the DSA, we do not know B in advance. 

The DSA can be geometrically interpreted in 2-D (as illustrated in Figure 
1 ) where the a;-axis represents the time axis and the y-axis represents the linear 
storage. The block Bj is represented by a rectangle whose x-extend is the time 
interval [aj,dj) and the y-extend is the (contiguous) space allocation denoted 
by {bj,bj + Sj). The DSA problem is then to pack the collection of rectangles 
given by = {Bi , i? 2 , . . . , Bn} into a larger rectangle. Since the a;-extend of each 
block is fixed, the rectangles are only free to move vertically and the allocation 
function b merely determined their y positions. 
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Fig. 1. A DSA instance with two blocks. 



For any allocation b{B), we define the cost L{b) = max(i<j<„){ 6 j -I- s^}. 
Geometrically, we can interpret L{b) to be the minimum height of a rectangle 



^ Our notations are similar to those of [Gerg96]. 
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that contains all the allocated rectangles. We distinguish two versions of the 
DSA. In the optimization version of the DSA, we are given an instance B of the 
DSA and we want to find a minimum cost allocation, namely, an allocation b that 
minimizes L{b). In the decision version of the DSA, we are given an instance B 
and a cost threshold Lq and we want to find an allocation b{B) with L{b) < Lq. 

The berth allocation problem (BAP) is precisely the decision version of the 
offline DSA problem if we model each ship by a block Bj = (sj,aj,dj), where the 
size Sj corresponds to the length of the ship, while and dj correspond to the 
arrival time and departure time of the ship. The cost threshold Lq corresponds 
to the length of the section. Then the allocation bj corresponds to the allocation 
the ship to the location {bj,bj + Sj) along the section. An allocation b{B) with 
L{b) < Lq corresponds to a BAP solution where all the ships are berthed without 
overlap within a section of length Lq . 

We note that the DSA is analogous to a 2-D rectangle packing problem - 
that of packing the smaller rectangles into the bigger rectangle. In Figure 2, we 
illustrate this viewpoint. We refer to the larger rectangle as the packing area. 
Allocating a rectangle can also be viewed as “packing” the rectangle into the 
packing area. For the rest of the paper, we shall use the term “allocate” and 
“pack” interchangeably. 
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Fig. 2. The DSA as a 2D packing problem, (a) A solution with width W = 5, mt^ iJS) = 
2, mtj iJS) = 3, and m(B) = 4. (b) Partitioning the packing area into 5 tracks. 

A block Bj is said to be active at time t if t G [oj^dj), or equivalently, if 
Qj < t < dj. Let B{t) denote the set of blocks that are active at time t, namely, 
B{t) = {Bj : t G [aj,dj)}. Then, the local density at time t (denoted by mt{B)) 
is defined to be the total size of the active blocks at time t, namely, mt{B) = 
SBjGB(t) The density of the instance B (denoted by M{B)) is the maximum 
local density, namely, M{B) = maxt {mt{B)}. Given an instance B of the DSA, 
it is easy to see that L{b) > M{B) for any allocation b{B). 

The DSA problem is NP complete and there has been a number of approxi- 
mation algorithms proposed. This ratio has been steadily reduced from 0{ln h) 
[Robs74], and to 80 by Kierstead [Kier88]. Slusarek [Slus89], and independently, 
Kierstead [Kier91] reduced the ratio to 6. Subsequently, this ratio was reduced to 
5 by Gergov [Gerg96] and later to 3 [Gerg99] . The latter result is the best-known 
general result to date. 
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Definition 1. Let DSA{W, M, h) denote the class of DSA instances B where 

(1) the maximum block size is h (namely, for all j, ^ < sj < h), 

(2) the density of the instance is M (M{B) = M ), and 

(3) the cost threshold Lq = W. 

Definition 2. An instance B G DSA{W, M, h) is said to be feasible if there 
exists an allocation b{B) with L{b) < W. The class DSA{W,M,h) is said to be 
feasible if all instances in the class are feasible. 

Then, the optimization version of the DSA problem can be formulated as 
follows: Let £{M, h) denote the smallest W such that DSA(W, M, h) is feasible. 
For a fixed h, let C{h) = lirriM^oo ■ We have the following results: 

Theorem 1. [GJ79] C{M, 1) = M for all M > 1. Namely, £(!) = 1 

Theorem 2. [Gerg99] £{M, h) < 3M for all h> 1. Namely, £{h) < 3 

Results for several special cases have also been reported. For the DSA vari- 
ant with two distinct block sizes (say Si and S 2 ), Gergov [Gerg96] showed that 
£{M, {si, S 2 }) < 2M or £({si, S 2 }) < 2 For the case of blocks with sizes 1 and 
h, it was shown [MuBh99] that the First Fit (FF) algorithm has approximation 
ratio 2 — 1/h, namely, £({l,/i}) < (2 — l/h). Note that this implies £(2) < 1^. 

For the online DSA problem, we let ODSA{W, M, h) denote the class of 
instances in which all the blocks have sizes < h and the local density at any 
time t is bounded by M and Lq = W. Let Af{M, h) denote the smallest W such 
that ODSA(W, M, h) is feasible. For a fixed h, let Af{h) = limM^oo '^^M'^'^ ■ 
the general case, the best result, due to Robson [Robs74], is that 0.5Mlog2 h < 
N{M,h) < 0.84Mlog2/i. For small values of h, it has been shown that the 
following holds [Knut73,Robs74,LuN074]: 

Theorem 3. For the online DSA, we have 

(a) A/"(2) = 1.5, and more precisely, Af{M,2) = \{3M — l)/2], 

r&;i|<Af(3)<i^, 

(c)M{{l,h}) = 2-l/h. 

In this paper, we study DSA{W, M, h) and obtain values of £{M, h) for 
small M and h = 2 and 3 using new algorithms for solving these basic cases. We 
then present new approximation algorithms based on channel partitioning that 
achieves improved bounds for £{h), h = 2,3. Both of these bounds are superior 
to the corresponding best known value for Af{h) (the online case). For ft, = 2, it 
is known that £{ 2 ) > | while the best known upper bound is |, which is the 
same as N{2). We improve this to £{2) < |. For ft = 3, we show that £{3) < 1.7 
while the best known bound for N{3) is l|i. 



^ Here, we extend the notation DSA{W, M, ft) to DSA{W, M, S), where the S denote 
the set of allowable block sizes. We also extend the notation of £ accordingly. 
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3 Solving Some Small Basic Cases 

We first solve DSA{W, M, h) classes with some small M and ft, = 2, 3. In solving 
these cases, we develop algorithms that partition the packing area into two ad- 
jacent channels and use a channel flip procedure for effective allocation. These 
ideas are central to the approximation algorithms developed in the next section 
for £(2) and £(3). 

In our packing algorithm for small DSA instances, it is convenient to divide 
the packing area into horizontal tracks of unit length as illustrated in Figure 
2(b). For DSA(VF, M, ft), we have tracks 1, 2, . . . , IF, where track k refer to the 
horizontal strip whose j/-extend is (ft— 1, k). For a block Bj (with size Sj), setting 
bj = ft is equivalent to allocating the (contiguous) tracks fc-|-l, ft-l-2, . . . , k+Sj 
to Bj, or equivalently, the y-extend of Bj is (ft, ft -I- s^). For example, allocating 
a (unit size) block Bj to track 1 means bj = 0. 

3.1 Basic Cases for h — 2 

Theorem 4. For the offline DSA problem, we have 

(a)C{2,2) = 2, £(3,2) = 3, ('cj £(4, 2) = 5, (d) C{5,2) = 6. 

It is easily shown that DSA{2,2,2) is feasible using a First Fit algorithm. The 
algorithm for solving DSA(3,3,2) is a modified version of online first fit which 
we call First-Fit with Channel Flip (FF-CF). We first divide the 3 tracks into 
two channels Ci with 1 track and C 2 with two contiguous tracks. There are two 
possible configurations: (a) C'i={l}, C'2={2,3} or (b) C'i={3}, C2={1,2}. The 
actual configuration in use will vary during the allocation process as described 
below. 

The algorithm FF-CF processes the blocks in B in increasing order by their 
arrival times. The blocks are packed (allocated) using a modified first fit algo- 
rithm trying channel Ci first, and then C 2 . With loss of generality, assume that 
the configuration used is Ci={l}, C'2={2,3}. In a general step, we want to pack 
a block B that arrives at time t. We distinguish the following cases: 

Case 1: If B has size I, B is allocated to the first free track using first fit. 
Case 2(a): If B has size 2 and channel C 2 is free, we allocate B to C 2 (tracks 
2, 3). 

If B has size 2 and channel C 2 is not free, then C\ must be free and there is 
exactly one occupied track in C 2 (since local density at time t < 3). There are 
two sub-cases: 

Case 2(b): If the middle track (track 2) of C 2 is free, then both tracks 1 and 
2 are free. We perform a channel-flip operation at time t in which we change 
the channel configuration to C2={1,2} and Ci={3}. We then allocate block B 
to the new C 2 . 

Case 2(c): If the middle track (track 2) of C 2 is occupied, then the two free 
tracks (tracks 1 and 3) are “split” by the middle track. This is illustrated in 
Figure 3. In this case, we “exchange” the two tracks in channel C 2 - namely, 
exchange the allocation of all the size 1 blocks in C 2 that arrives after the most 
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recent size 2 block. After the exchange operation, tracks 1 and 2 are free at time 
t and this case reduces to Case 2(b) above. We perform a channel-flip operation 
at time t and allocate B to the new C 2 . 




Fig. 3. Example of track exchange and Channel Flip operations for DSA(3,3,2). (a) 
Channel Ci and Ci before block Bs is assigned; tracks 1 and 3 are free, (b) The 
configuration after exchanging the “active size 1 blocks” in C 2 (blocks Be and B 7 ), 
and performing a channel flip. The block Bs is assigned to the new C 2 . 



Algorithm 1: DSA-Alg(3,3,2,B): [FF-CF Algorithm] 

1. begin 

2. Sort the blocks in B by arrival time; 

3. Define channels Ci={l} and C2={2,3}; 

4. BS ^ 0, C2Start ^ 0 

5. for each block Bj £ B do 

6. if {sj = 1) then 

7. Pack Bj at the first free track using First Fit; 

8. if {Bj is packed into C 2 ) then FS ^ FS U {Bj} ; 

9. else / / {sj = 2) 

10. if (C 2 is free) 

11. then Pack Bj into C 2 ; 

12. else Channel-Flip(BS', Bj)-. 

13. C2Startv- dj-, BS ^ 0; 

14. end 

15. Procedure Channel-Flip(BS, Bj)-, 

16. begin 

17. if (middle track of C2 is occupied) 

18. then Exchange the track allocation of the blocks in FS on C 2 ; 

19. Flip the channel configuration of Ci and C 2 -, 

20. Pack Bj in the new C2-, 

21. end {Flip} 

The details of the algorithm is shown in Algorithm 1. Correctness of the 
algorithm is easily shown by proving the invariant that every size channel-flip 
operation generates a new empty C 2 channel that is used to pack a size 2 block. 
The running time for this algorithm is O(nlogn), since we need to sort all the 
blocks, and each block would get allocated once, and flipped at most once. 
Proof of Theorem 4(c): To prove Theorem 4(c), namely that £(4,2) = 5, 
we first prove that DSA(4,4,2) is infeasible. Figure 4 shows an instance of 
HS'A(4, 4, 2) that can be shown to be not feasible (using a case analysis). 
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Lemma 1. DS'A(4,4, 2) is not feasible. 

Corollary 1. DSA{M,M,h) is infeasible for all h>2, and M > 4. 
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Fig. 4. An infeasible D5'A(4, 4, 2) instance. A careful case analysis shows that this 
instance requires 5 tracks, namely IF = 5. It also gives a lower bound of | for C{2). 

Next, we show that DSA{5, 4, 2) is feasible by giving a constructive algorithm 
called First-Fit with Block Partitioning (FF-BP). Instead of partitioning the 
tracks, the algorithm partitions the blocks in B into disjoint subsets B\ and 
B 2 which are then separately packed. For 135^(5,4,2), the algorithm FF-BP 
partitions B so that the subset B\ has density 2 {m{Bi) < 2), and the subset 
B 2 has density 3 {m{Bi) < 3). Then the blocks in B\ are packed in 2 tracks 
using algorithm DSA(2,2,2) and the blocks in B 2 are packed in 3 tracks using 
algorithm DSA(3,3,2). 

Algorithm FF-BP goes as follows: blocks in B are processed in increasing 
arrival times. Whenever a blocks Bj arrives, it is appended to B\ if mt{Bi)-\-Sj < 

2. Otherwise, it is appended to B 2 - (Details are given in Algorithm 2.) 

The correctness of this procedure is shown as follows: First, m{Bi) < 2 by 
construction, and so we only need to show that m{B 2 ) < 3. Blocks of size 1 
are trivially placed. Consider a block Bj of size 2 arriving at time t. Then, 
at most two tracks are occupied at time t (since total density is 4), namely, 
mt{Bi) mt{B 2 ) < 2. If Bj cannot be appended to Bi, then mt{Bi) > I and so 
'mt{B 2 ) < 1. Thus, rrit{B 2 ) < 3 after the block Bj is appended to B 2 - 

Algorithm 2: DSA-Alg(5,4,2,5): [FF-BP Algorithm] 

1. begin 

2. Sort the blocks in B by arrival time; 

3. Bi <— 0; mi = 2; 

4. f?2 <— 0; m 2 = 3; 

5. for each block Bj do 

6. t ^ flii 

7. if (mt(Bi) -I- Sj < mi) 

8. then ^ U {Bj}; 

9. else B 2 ^ B 2 U {Bj}; 

10. Apply DSA-Alg(2, 2, 2. Bi); 

11. Apply DSA-Alg(3, 3, 2, B 2 ): 

12. end 

The algorithm FF-BP also solves DSA(6, 5, 2) by partitioning the blocks 
into two subsets Bi and B 2 having density 3 each. The proof is omitted here. 
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3.2 Basic Cases for h — 3 

Theorem 5. For the offline DSA problem, we have 
(a) C{3,3) = 3 and (h)C{A,3)=b. 

The proof of Theorem 5(a), namely, than £(3,3) = 3, is simple. We first assign 
all the blocks of size 3. Then between any two consecutive blocks of size 3, we 
have an instance of DSA{3,3,2) that is feasible. 

To prove Theorem 5(b), we first note that DS'A(4,4,3) is infeasible by Corol- 
lary 1. Next, we prove that DS'A(5,4,3) is feasible. The algorithm used is a gener- 
alization of the First Fit with Channel Flip procedure uses in DSA-Alg(3,3,2,;8). 
We have two channels C 2 with 2 tracks and C 3 with 3 tracks. Without loss of 
generality, we assume that C 2 = {1, 2} and C 3 = {3,4, 5}. 

Consider a block Bj arriving at time t. If Sj = 1, it is trivially assigned using 
first fit. If Sj = 2, if C 2 is free, Bj is assigned to C 2 otherwise, Bj is assigned to 
C 3 using DSA-Alg(3,3,2) If Sj = 3, and C 3 is free, Bj is assigned to C 3 . If C 3 
is not free, then C 2 must be free and there is exactly one occupied track. If track 

3 is free, we perform a channel flip at t so that the new C 2 = (4, 5} and the new 
C 3 = (1, 2, 3} is free and assign Bj to the new C 3 . (If track 3 is not free, we first 
invert the tracks in channel C 3 so that track 3 is now free.) 

4 Approximation Algorithms for Special Cases 

In this section, we show how the results for the small instances can be used to 
give improved approximation ratios, namely, C{h), for small h. 



4.1 Algorithms Based on Multi-level Block Partitioning 

We extend the block partitioning ideas used in DSA-Alg(5,4,2) to a multi-level 
block partitioning algorithm called FF-MLBP. We illustrate the algorithm with 
the case h = 2. The block Bj in B are considered in increasing arrival time. The 
algorithm proceeds in 2 phases as follows: 

Phase 1: Let ni = We define rii level-1 subsets denoted by B\, B\, ... , 

B\^, each with maximum density m\ = 3. Each block Bj is assigned using first 
fit to the first subset that can accomodate it. If Bj does not fit into any of these 
subsets, it is left unassigned (for Phase 2). (Note: We show later that all blocks 
of size 1 are assigned in Phase 1 and there are now only blocks of size 2 left.) 

Phase 2: Now, define ri 2 = |"^] level-2 subsets denoted by B^, B 2 , ... , B"^^, 
each with maximum density m 2 = 2. It is trivial to assign the remaining blocks 
using first fit to these subsets. 

Theorem 6. DSA{A/3M + A, M, 2) is feasible, namely, £(2) < |. 

® Note that this assignment to channel £3 may require an “internal” channel flip 
within the 3 tracks of £ 3 . 
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Proof. We first prove the correctness of the algorithm. It is easy to see that all 
blocks of size 1 are assigned in Phase 1 since there are rii = \^~\ subsets each 
with maximum density of 3. Hence, there are only blocks of size 2 left in Phase 2. 
Suppose that a block B of size 2 arriving at time t cannot be assigned to any one 
of the level-2 subsets (in Phase 2). This implies that at time t, each of these ri 2 
subsets must have density 2 (since there are no blocks of height 1). Furthermore, 
each of the level- 1 subsets must have density at least 2 at time t (since block B 
was not assigned in Phase 1). Thus, the total density at time t (excluding block 
B) is > 2m -I- 2ri2 = 2 [ + 2 [ > M. This contradicts the density bound 
M for all blocks. Thus, all the blocks are successfully assigned in Phase 2. 

We now consider the number of tracks needed. Each of the ni level-1 sub- 
sets can be solved using DSA(3,3,2) and each of the ri 2 level-2 subsets with 
DSA(2,2,2). Thus, the total number of tracks needed is 3ni -|-2n2 < + 4. □ 

This result improves the previous bound for £(2) from 3/2 (by [MuBh99]) to 
4/3. This multi-level block partitioning approach can be generalized for h = 3 
to give a bound of 11/6 (=1.833) for £(3) using a 3-phase block parititioning 
approach with ni = [ , ns = , and with mi = m2 = m3 = 3. 

However, a superior result is described in the next section. 



4.2 An 1.7- Approximation Algorithm for h — 3 

In this section, we will present a special multi-level block partitioning algorithm 
for h=3 that gives a bound of 1.7 for £(3). We first describe a customized block 
partitioning algorithm (called FF-BP4) that assigns blocks to a packing area 
with 4 tracks. We divide the packing area into two channels Cl and C2 each 
with 2 contiguous tracks, say Ci = {1, 2} and C 2 = {3,4}. A block of size 1 can 
be assigned to any free track. A block of size 2 can only be assigned to channel 
Cl or C 2 (but not to the tracks 2 and 3 even if they are free). A block of size 3 
can be assigned to tracks {1,2,3} or {2,3,4} if there is room (with one of the 
channels flipped if necessary) . Any block that can not be assigned in this way is 
left unassigned. 

Let Bi be the set of assigned blocks after running algorithm FF-BP4. We 
prove that mt{Bi) > 2 whenever a block B cannot be assigned at time t. If B 
is of size 1, then rrit{Bi) = 4.1i B is of size 2, then rrit{Bi) > 3 except for the 
case when tracks {2,3} are free in which case mt{Bi) > 2. If B is of size 3, then 
mt(^i) > 2, otherwise we always pack B successfully. 

We are now ready to present the multi-level block partitioning algorithm for 
h=3. The algorithm is the same as the 3-phase FF-MLBP except that we use 
different algorithms to assign blocks to the subsets. 

Phase 1: Define n\ = \ level-1 subsets, with mi = 4 and assign blocks using 
algorithm FF-BP4. After this phase, only blocks of sizes 2 and 3 remain. 

Phase 2: Define U 2 = \ ^~\ level-2 subsets, with m2 = 6 and assign blocks using 
the algorithm for DSA(6,6,{2,3}) from Theorem 6(d). All blocks of size 2 are 
assigned in this phase. 
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Phase 3: Define ns = [ level-3 subsets, with m3 = 3 and assign blocks using 
first fit. 

Theorem 7. DSA{1.7M + 13, M, 3) is feasible, namely, £(3) < 1.7. 

Proof. We first prove the correctness of the algorithm. Suppose that a block B 
of size 2, arriving at time t cannot be assigned in Phase 2. This implies that at 
time t, each of the ni level-1 subsets must have density at least 2, and each of 
the ri2 level-2 subsets must have density at least 5. Thus, the total density at 
time t (excluding block B) is > 2m -|- 5ri2 = 2 [ + 5 [ > M, which gives 
a contradiction. Thus, all blocks of size 2 are assigned in Phase 2. 

Now, suppose that a block B of size 3, arriving at time t cannot be assigned 
in Phase 3. This implies that at time t, each of the ni level-1 subsets must have 
density at least 2, each of the U 2 level-2 subsets must have density at least 4, and 
each of the ns level-3 subsets must have density of 3, Thus, the total density at 
time t (excluding block B) is > 2m -|-4n2-l-3n3 = 2 +4 -|-3 \^~\ > M, 

which gives a contradiction. Thus, all blocks of size 3 are also assigned and the 
algorithm is correct. 

The total number of tracks needed is 4m -I- 6n2 -I- 3ns < 1.7M -|- 13. □ 
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1 Introduction 

The fc-center problem is a well-known facility location problem and can be de- 
scribed as follows: Given a complete undirected graph G = (V,E), a metric 
d : V X V K+ and a positive integer k, we seek a subset C/ C U of at 
most k centers which minimizes the maximum distances from points in U to U . 
Formally, the objective function is given by: 

min£/cy,|£/|<fe max„gy minuet/ d{v, r). 

As a typical example, we may want to set up k service centers (e.g., police 
stations, fire stations, hospitals, polling centers) and minimize the maximum 
distances between each client and these centers. The problem is known to be 
AfT^-hard [2]. 

A factor p-approximation algorithm for a minimization problem is a poly- 
nomial time algorithm which guarantees a solution within at most p times the 
optimal cost. For the /c-center problem, Hochbaum and Shmoys presented a 
factor 2-approximation algorithm and proved that no factor better than 2 can 
be achieved unless V = NV [3]. Approximation algorithms for other fc-center 
problems where vertex costs are considered or when vertex weights are used 
have been extensively studied [9]. More recently, Bar-Ilan, Kortsarz and Peleg 
investigated an interesting generalization of capacitated fc-center problem where 
the number of clients for each center was restricted to a service capacity limit or 
maximum load [1] . Their work was improved in recent work by Khuller and Suss- 
mann [7]. On the other hand, to ensure backup centers are available for clients, 
Krumke developed a ’’fault tolerant” fc-center problem, where the objective was 
to minimize maximum distances as before, but where each client is required to 
be covered by a minimum number of centers [8]. Approximation algorithms for 
these problems were improved and extended in [6]. 

In these studies, no provision was made to ensure centers provide a minimum 
coverage of clients. In the fault tolerant problem, the client demand side of the 
problem is guaranteed coverage by a minimum number of centers (less than k), 
yet, on the supply side, there is no guarantee that each center services a minimum 
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number of clients. In realistic applications however, such coverage is a common 
requirement. For example, in planning the location of hospitals, it would be 
expected that each hospital services a minimum number of neighborhoods. This 
would impose a balanced workload among hospitals and allow for economies of 
scale. Moreover, in cases when each center is equipped to provide a variety of 
services, a spread of clients covered is more likely to benefit service providers 
and clients alike. For example, where warehouses stock a variety of products, it 
would be beneficial if each services a spread of customers whose demands are 
more likely to include the range of products available. In this work, we address 
these provisions by extending the basic /c-center problem to include a minimum 
coverage requirement. We allow coverage by different centers to overlap allowing 
clients to choose from a number of centers. In the problem, we minimize distances 
as in the basic /c-center problem and require that every vertex in V is covered 
by one of the at most k selected centers in U . Further, each center in U must 
cover at least q vertices in V, where g is a non-negative integer, at most as large 
as \V\, which defines the minimum coverage for each center. 

We call this a g-all-coverage /c-center problem, with an objective function 
given by: 



min£/cy,|(7|<fe max(max„gy miurec/ d{v, r), max^et/ dq{V, r)), 

where dq{V,r) is the distance to r from its closest vertex in V. Note that 
because r G its closest vertex is r itself. 

Further, two variations to this problem will be studied. The first is a q- 
coverage fc-center problem, for which only vertices inV — U are counted in the 
coverage of every center in U . Its objective function is: 

min(7cy|£/|<fe max(max„gy_(7 minuet/ d{v, r), max^ec/ dq{V - U, r)), 

where dq{V — U,r) is the distance of r from its g*'' closest vertex in V — U. 

The second is a g-coverage /c-supplier problem for which V is partitioned into 
two disjoint subsets: S, a supplier set, and D, a demand set. The problem is then 
to find a subset U of at most k centers in S to minimize distances where not 
only is every demand point in D covered by a center in U, but every center in 
U must cover at least q demands in D. Here, the objective function is: 

min(7cs,|(7|<fc max(max„gD minuet/ d{v, r), max^et/ dq{D, r)), 

where that dq{D,r) is the distance of r from its g**' closest demands in D. 

Additionally, these three problems can be generalized by the inclusion of 
vertex costs and vertex weights, as has been done for the basic fc-center problem. 
To include costs, we define a cost c{v) for each vertex v in V where we now 
require This cost generalization is useful, for example, in the 

case of building centers where the cost for centers can vary and when there is a 
limited budget as is the case in practice. 

To extend the problems by including weights, we take w{v) be the weight 
of each vertex v in V so that the weighted distance to vertex v from vertex 
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u in is w{u,v) = d{u,v) • w{v). For any vertex v € V and X C V , let 
Wq{X,v) to be the closest weighted distance of v from the vertices in X. 
With this, the three variants can be generalized to weighted models by replacing 
distances d and dq in the objective functions with the weighted distances w and 
Wq, respectively. Weighted distances can be useful, for example, when l/w{v) is 
modelled to be the response speed of the center at vertex v, which then makes 
w{u,v) = d{u,v).w{v) its response time. 

Finally, by considering both vertex costs and vertex weights, we study the 
most general extensions for the three new problems. 

Throughout this paper, OPT denotes the optimal value of the objective 
function. We assume that the complete graph G = (V,E) is directed, where 

V = {wi, ..., Vn\ and E = V x V = {ei, ..., Cm} where m = where each vertex 

V G V has a self-loop {v, v) G E with distance d{v, v) = 0. For each edge Cj G E, 
let d{ei) and w{ei) denote its distance and its weighted distance, respectively. A 
vertex v is said to dominate a vertex u, if and only if v is equivalent to u {v = u) 
or V is adjacent to u and we denote the number of vertices dominated by v in 
G by deg~^{v). For each vertex v in the undirected graph H , deg{v) is its degree, 
i.e. the number of adjacent edges including the possible self-loop (v,v), and for 
any undirected graph El, I{H) denotes a maximal independent set [2], in which 
no two different vertices share an edge in H and no vertex outside I{H) can be 
included while preserving its independence. 

We present approximation algorithms for the three problems considered in 
this paper and their generalizations. Our methods extend from the threshold 
technique used for the basic /c-center problem [4], and are designed to address 
the new minimum coverage constraints included. 

2 Main Results 

Our main results are summarized as follows. 





Basic 


Weights 


Costs 


Weights -1- Costs 


g- All-Coverage AT-center 


2+ 


3 


3* 


2/3 -k 1 


g-Coverage AT-center 


2t 


4 


4 


3/3 + 1 


g-Coverage Al-supplier 


3t 


3t 


3t 


2/3+1 



We use t to indicate the best possible approximation factors have been achieved, 
which are shown to be 2, 2 and 3 for the three problems, respectively, unless 
V = MV. These optimal results include the basic cases of all the three prob- 
lems considered, and the weight and the cost generalizations of the g-coverage 
fc-supplier problem. Moreover, for the weight and the cost generalizations of 
the other two problems, approximation algorithms are provided with constant 
factors, all of which are close to their best possible approximation factor of 2. 
Especially, for the cost generalization of the g-all-coverage fc-center problem indi- 
cated by I in the table, a 3-approximation algorithm is achieved which matches 
the best known approximation factor for the cost generalization of the classical 
fc-center problem [4] . Further to this, the approximation algorithms for the cost 
generalizations of the three problems can be extended to solve their weight plus 




352 



Andrew Lim et al. 



cost generalizations. Let [3 denote the ratio between the maximum value and 
the minimum value of weights. Their approximation factors are consistent with 
those of their cost generalizations, which hold when (3=1. 

3 qr- All-Coverage fc-Center Problems 

The following hardness result for the g-all-coverage /c-center problem can be 
proved by extending the reduction from the Domination Set problem [2] used 
for the classical /c-center problem [5]. Details of the proof are omitted here but 
described in [10], for the shortage of space. 

Theorem 1. Given any fixed non-negative integer q, there is no polynomial- 
time (2 — e)- approximation algorithm for the q- all- coverage k-center problem, 
unless I\fV = V . 

The best possible approximation factor of 2 can be achieved by Algorithm 1. 
We first sort edges in E by order of non-decreasing distances, i.e., d{ei) < d{e 2 ) < 
... < d{em). Let Gi = (V,Ei) where Ei = {ei,...,ei} for 1 < i < m. Thus, if 
Gi has a set U of at most k vertices that dominate all vertices in Gi, and each 
vertex of U dominates at least q vertices (including itself) in Gi, then U provides 
at most k centers to the problem with at most d{ei) distance. Let i* denote the 
smallest such index. So d{e*) = OPT is the optimal distance. 

To find a lower bound for OPT, construct an undirected graph Hi. Hi con- 
tains an edge {u, v) where u,v € V might be equal if and only if there exists a 
vertex r G V with deg~^{r) > q and both (u,r) and (v,r) are in Gi. It is clear 
that the self loop {v, v) remains in Hi for each v £ V, and that if {v, u) £ Gi 
then (v,u) £ Hi since {v,v) and {u,v) are in Gi. As any two vertices dominated 
by the same vertex in Gi are adjacent in Hi, Hi* satisfies the following: (1) for 
each vertex v £ V, deg{v) > q in Hi*, including its self-loop; (2) the size of its 
maximal independent set \I{Hi*)\ < k. Accordingly, suppose that the threshold 
j is the minimum index i leading Hi to satisfy the above two conditions, then 
we have j <i* , which gives d{ej) < OPT. 

Finally, selecting vertices in the maximal independent set Hj, we have \I{Hj)\ 
< k. So, centers in I{Hj) dominate all vertices of V in Hj, and each v £ I{Hj) 
dominates at least deg{v) > q vertices (including itself) of V in Hj. By the 
triangle inequalities, we know d{u,v) < 2d{ej) < 2 • OPT, for every (u,v) in 
Hj. So the set U gives at most k centers with at most 2 • OPT distance, which 
establishes the following theorem. 



Algorithm 1 (Basic g-All-Coverage fc-Center) 

1 : Sort edges so that d{ei ) < d(c 2 ) ^ • • • ^ d{em ) , and construct Hi , H 2 , ■■■, Hm ; 

2: Compute a maximal independent set, I{Hi), in each graph Hi, where 1 < i < m; 

3l Find threshold j, denoting the smallest index i, such that \I(Hi)\ < k, and for each v ^ V, 
ideg{v) > g in Hi', 

4: Return I{Hj). 
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Theorem 2. Algorithm 1 achieves an approximation factor of 2 for the q-all- 
coverage k-center problem. 

3.1 Any q with Weights 

For the weighted case of the g-all-coverage fc-center problem, a 3-approximation 
algorithm follows. Firstly, sort edges by nondecreasing weighted distances, i.e., 
w{ei) < 10 ( 62 ) < < w{em) and let Gi = (V,Ei) where Ei = {ei,...,ei}. 

The graph Hi for Gi contains an edge (u, v) where u,v € V might be equal if 
and only if there exists a vertex r which has both (u,r) and (v,r) in Gi, which 
implies w(u,r) < w(ei) and w(v,r) < w(ei). To bound the optimum weighted 
distance (OPT), find the minimum index i and threshold j so that the degree 
of each vertex in Hi is at least q and the size of its maximal independent set is 
V(Hi)\ < k. This ensures w(ej) < OPT. Finally, consider each vertex v G V. 
Among dA u G V with w(v,u) < w(ci), let gi(v) denote the vertex having 
the smallest weight, i.e., the least weighted neighbor of v in Gi. Shifting every 
V G I(Hj) to gj(v), we obtain the set U which guarantees an approximation 
factor of 3 given by the following theorem. 

Theorem 3. An approximation factor of 3 can he achieved polynomially for the 
weighted q- all- coverage k-center problem. 

Proof. Firstly, by \I(Hj)\ < k and U = {gj(v)\v G I(Hj)}, we have \U\ < k. 
Next, for any vertex u G V, there must exist v in I(Hj) with (u, v) in Hj, which 
gives a vertex r G V with both (u, r) and (v, r) in Gj. Hence, w(u, r) < w(ej) and 
w(v,r) < w(cj). Since w(gj(v)) < w(r) and w(v, gj(v)) < w(cj), u is covered by 
gj(v) G U within w(u, gj(v)) < (d(u,r) -\- d(v,r) -\- d(v, gj(v)))w(gj(v)) < 3w(cj). 
Furthermore, the degree of any vertex v G I(Hj) is at least q, which implies 
at least q vertices like u, equivalent or adjacent to v in Hj, can be covered by 
gj(v) G U within 3w(ej). Since w(cj) < OPT, the approximation factor is 3. □ 

3.2 Any q with Weights and Costs 

We now give a (2/3-1- l)-approximation algorithm for the most general case where 
vertices have both weights and costs. If only cost is considered, a 3-approximation 
can be achieved where /3 = 1. 

The algorithm here is similar to that for the q-all coverage fc-center problem, 
except that a new set Ui is constructed by shifting each v G I (Hi) to Si(v), where 
Si(v) is the vertex who has the lowest cost among dAu gV with w(v, u) < w(ei). 
Hence Si(v) is called the cheapest neighbor of v in Gi and we take Ui = {si(u) G 
I(Hi)} and c(Ui) to denote the total costs of vertices in Ui. Because no two 
vertices in I (Hi) are dominated by a common vertex in Gi, the index i* with 
w(ei») = OPT leads Hi* to satisfy the following: (1) for each vertex v G V, 
deg(v) > q in Hi*, including its self-loop; (2) c(Ui*) < k. 

Finding the threshold j to be the minimum index i which causes Hi to satisfy 
the above two conditions, we have j < i* and w(cj) < OPT. Furthermore, we 
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will prove that the Uj provides at most k cost centers ensuring an approximation 
factor of (2/3 + 1) in the following. 

Theorem 4. An approximation factor of (2/3+ 1) can he achieved polynomially 
for the weighted and cost q-all-coverage k-center problem. 

Proof. Because cfUj) < k and w{ej) < OPT have been shown, we need only 
show that the objective function distance given by Uj is at most (2/3+l)w(ej). On 
one hand, for any vertex u G V, there exists a vertex v G I{Hj) adjacent to u in 
p{j. This implies there is a vertex r G V with both w(u, r) < w(ej) and w{v, r) < 
w{ej). Since w{r) < f3w{sj{v)) and w{v, Sj{v)) < w{cj), we know that v is cov- 
ered by Sj{v) G Uj within w(u, Sj(z;)) < {d{u,r) + d{v,r) + d{v,Sj{v)))w{sj{v)) < 
(2/3 + l)w{ej). On the other hand, because the degree of any vertex u G I{Hj) 
is at least q, there are at least q vertices like u, equivalent or adjacent to v in 
Hj, covered by Sj{v) G Uj within at most (2/3+ l)w(ej) weighted distance. Since 
w{ej) < OPT, the approximation factor is (2/3 + 1). □ 

4 qr-Coverage fc-Center Problems 

Compared with the g-all-coverage fc-center problem, the g-coverage /c-center 
problem has an additional stipulation: for each selected center v, the at least 
q vertices covered by v should be outside the set of selected centers. 

To determine its hardness, we provide the following theorem, which can be 
shown by a modified reduction used for Theorem 1. Details of the proof are 
omitted here but described in [10], for the shortage of space. 

Theorem 5. Given any fixed non-negative integer q, there is no polynomial- 
time {2 — e)- approximation algorithm for the q-coverage k-center problem, unless 
NV = V. 

The best possible approximation factor of 2 can be achieved for the g-coverage 
fc-center problem in a similar way as the algorithm for the g-all-coverage /c-center 
problem. The only difference is that the threshold j, found here, must cause the 
degree deg{v) to be at least g + 1 in Hj instead of g for each vertex v GV, since 
self-loops might exist but each center should be adjacent to g vertices other than 
itself. The approximation factor of 2 is proved by the following theorem. 

Theorem 6. An approximation factor of 2 can he achieved polynomially for the 
q-coverage k-center problem. 

Proof. By the same analysis for Theorem 2, we know that d{ej) < OPT, and 
that I{Hj) provides at most k centers which cover all the vertices within at most 
2d{ej). Since each vertex v G I{Hj) is adjacent to at least g vertices other than 
itself in Hj, to prove its g-coverage within 2d(ej) we need only show that no two 
vertices in I{Hj) are adjacent to each other in Hj. This is obvious, since I{Hj) 
is an independent set of Hj. Since 2d{cj) < 2 ■ OPT, the approximation factor 
is 2. □ 
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Algorithm 2 (weighted g-Coverage /c-Center) 

1: Sort edges so that w{ei) < w{e 2 ) < ... < w{em)'i 
2: For each 1 < z < m, let — {v\deg'^ {v) > g + 1}; 

3l Construct graphs, Pi , P 2 1 

4: Compute a maximal independent set, I{Pi) for each graph Pi where 1 < z < rrz; 

5l Find the threshold j, denote the smallest index z, such that Qi dominates all vertices of V in 
Gi, and |/(Pi)| < k- 

61 For each vertex v e I{Pj), let p{v) denote the lowest weighted vertex among all vertices u G Qj 
with an edge (v,u) in Gj‘, 

7: Shift vertices in I{Pj) by U' — {p(z;)|z; G /(Pj)}; 

81 Construct H' — {U',E') from Gj, where for any two vertices u and v in I{Pj), an edge 
(p(iz) , p(u)) G E' if and only if either (p(z?) , p(iz)) or (p(zz), p(u)) is in Gj ; 

Call Algorithm 3 to obtain I(H'), a maximal independent set of H' , insuring that for any vertex 
u ^ U' — there exists a vertex v G with (zz, u) G E' and w{v) < zc(zz); 

10: Return I(H'). 



4.1 Any q with Weights 

The weighted case of the g-coverage fc-center problem can be solved by Algo- 
rithm 2, which is more intricate than previous algorithms and can be described 
as follows. 

First, after sorting the m edges, an undirected graph Pi, instead of Hi, is 
constructed from Gi for 1 < i < m. The construction is as follows. Let Qi be the 
subset of V G V with deg~^{v) > <7 -I- 1. For any u,v G V where u and v might 
be equal, an edge {u, v) is in Pi if and only if there exits r G Qi so that both 
(u,r) and (v,r) are in Gi. Consider the index i* where w{ei*) = OPT. Because 
each selected center must dominate at least q vertices other than itself, and no 
two vertices in I (Pi* ) are dominated by the same vertex in Gi* , we observe that: 
(1) each vertex of V is dominated by at least one vertex in Qi*; (2) the size of 
I (Pi*) must be at least as large as k, i.e. \I{Pi*)\ < k. Accordingly, define the 
threshold j to be the smallest index i, such that Qi dominates all vertices of V, 
and \I{Pi)\ < k. The two observations above imply w(ej) < OPT. 

Second, shift vertices in I{Pj) as follows. For each vertex v G I{Hj), let p{v) 
denote the smallest weighted vertex, among all u G Qj with an edge (v, u) in 
Gj. This gives U' = {p{v)\v G I{Pj)}. 

Now, consider an undirected graph, H' = ([/', E'), where, for any two vertices 
u and V in I{Pj), an edge {p{u),p{v)) G E' if and only if either {p{v),p{u)) or 
{p{u),p{v)) is in Gj. Its maximal independent set, denoted by I{H'), can be 
obtained greedily by Algorithm 3. It is easily seen that for any vertex u G 
U' — I{H'), there exists a vertex v G I{H') with (u,v) G E' and w{v) < w{u), 
where v could be the vertex that marks u in Algorithm 3. 

Now, we prove that I{H') provides at most k centers ensuring an 4-approxi- 
mation factor to establish the following theorem. 

Theorem 7. Algorithm 2 achieves an approximation factor of 4 for the weighted 
q-coverage k-center problem. 

Proof. Noting w{ej) < OPT and \I{H')\ < \U'\ < \I{Pj)\ < k, we need only 
prove the following two facts: (1) each p{v) G I{H') covers at least q vertices u G 
V — I{H') within w{u,p{v)) < 4w{ej), where v G I{Pj); (2) each u G V — I(H') 
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Algorithm 3 (Maximal Independent Set of = {U\E') with Weights w) 
1 : (7 ^ 0 ; 

2: while U' ^ ^ do 

3l Choose the vertex v, which has the smallest weight iu(n) among all u e U'\ 

4: U ^ C/-h {v} and U' ^ U' - {v}-, 

5l Mark all the vertices u ^ U' adjacent to v, i.e {u,v) e E' ^ by U' ^ U' — {'u-}; 

6l end while 

7l Return U as the maximal independent set of H' . 



is covered by a certain vertex p{v) G I{H') within w{u,p{v)) < 4w{ej), where 
V G I{Pj)- On one hand, consider each p{v) G where v G I{Pj)- Because 

I{H') C U' C Qj, we know p{v) G Qj, and so, there exist at least q vertices, 
other than p{v), which are dominated by p(v) in Gj. Moreover, each vertex u 
of these q vertices is not in I{H'), because otherwise, the edge (u,p{v)) in Gj 
implies an edge (u,p{v)) in H' , contradicting to the independence oiI{H'). Note 
that w{u,p{v)) < w{ej) < 4w{ej). The fact 1 is proved. 

On the other hand, consider each vertex u G V — Because I{Pj) 

is a maximal independent set of Pj, there exists a vertex ti G I{Pj) with 
an edge (u,ti) in Pj. (Note that if u is in I{Pj), a self loop (u,u) must be 
in Pj because all vertices of V are dominated by Qj). Thus, we know that 
p{ti) is in H' . Since I{H') is a maximal independent set of H', there exists 
a vertex ^(^2) G I{H') for t2 G I{Pj), with w{p(t2)) < w{p(ti)) and an edge 
{p{ti),p{t2)) in H'. So w{p{ti),p{t2)) < w{p{t2),p{ti)), implying {p{ti) , p{t2)) 
is in Gj. Because (u,ti) is in Pj, there exits a vertex a G Qj dominating 
both u and ti in Gj, leading w{p{ti)) < w{a). Noting that weighted distances 
of (u,a), (ti,a), (ti,p{ti)), and {p{ti),p{t2)) are all at most w{ej), we have 
w{u,p{t2)) < {d{u,a) + d{ti,a) + d{ti,p{ti)) + d{p{ti),p{t2)))w{p{t2)) <4w(ej). 
This proves the fact 2 and completes the proof. □ 

4.2 Any q with Weights and Costs 

The basic idea employed to solve this problem is to combine and modify algo- 
rithms for the g-all-coverage and the g-coverage fc-center problems. Here, we con- 
struct Hi, ..., Hra first and sort edges e\, ..., Cm by their nondecreasing weighted 
distances. 

However, to find the threshold j we need a new approach. For 1 < z < m, an 
undirected graph iJ' is generated in the following manner: For any two vertices 
u,v G V, the edge (u,v) is in Ft', if and only if there exists a vertex r G V, 
such that either (zt, r) is in Gi and (v,r) is in Hi, or (v,r) is in Gi and (u,r) 
is in Hi. We then compute I{H[), a maximal independent set of it', and shift 
each vertex v G I{H[) to its lowest cost neighbor Si{v) in Gp, this forms the set 
Ui = {s,{v)\vGl{H'i)}. 

Now, we find the threshold j, which is the minimal index i, giving deg(v) > 
q + 1 in Hi for each vertex v GV and c(C/') < k, where c(C/') denotes the total 
cost of vertices in C/'. Observing that iF' is a subgraph of Hi, we know I{H[) 
is also an independent set of Hi. By similar arguments for algorithms of the 
g-all-coverage and the g-coverage F-center problems, we derive w{ej) < OPT. 






fc-Center Problems with Minimum Coverage 



357 



Furthering, we can show that C/j gives at most k cost centers within at most 
(3/3 + 1) • OPT weighted distance. (See [10] for details.) 

Theorem 8. An approximation factor o/ 3/3 + 1 can be achieved polynomially 
for the weighted and cost q-coverage k-center problem. 

In addition, for the g-coverage /c-center problem with cost only, an approxi- 
mation factor of 4 can be obtained by /3 = 1. 

5 qr-Coverage fc-Supplier Problems 

The g-coverage fc-supplier problem partitions the vertex set V into the supplier 
set S and the demand set D that are disjoint. Hence, at most k centers need 
be selected from S', to minimize the distance within which all the vertices in set 
D are covered by centers each of which must cover at least q suppliers in H. In 
order to determine its hardness, we present the following theorem which can be 
proved by a reduction of Minimum Cover problem [2]. Details of the proof are 
omitted here but described in [10], for the shortage of space. 

Theorem 9. Given any fixed non-negative integer q, there is no polynomial- 
time {3 — s)- approximation algorithm for the q-coverage k-eenter problem, unless 

Afr = p. 

The best possible approximation factor of 3 can be achieved for the q-coverage 
fc-supplier problem, even for its weighted extension and its cost extension. In the 
rest of this section, we first provide the weighted case a 3-approximation algo- 
rithm that is applicable for the basic case by specifying w{u) = 1 for each supplier 
u € S. Then, we provide the weighted and cost case a (2/3 -I- l)-approximation 
algorithm that ensures a factor of 3 for the cost only case with /3 = 1. 

5.1 Any q with Weights 

The approximation approach follows. As before, edges are sorted non-decreas- 
ingly, i.e., w{ei) < w{e 2 ) < ... < w{em). We have subgraphs G\, G 2 , ■■■, Gm, 
where Gi = {V, Ei), V = S U D and Ei = {ei, ..., e^}. To obtain the threshold 
index, a new graph Li is constructed on the demand set D for each Gi as follows. 
For each two demands u,v € D where u may equal to v, an edge (u, v) is in Li if 
and only if there exists a supplier r G S with both (u, r) and (u, r) in Gi. Hence, 
self- loops of all the vertices in V are still in Li. Let I {Li) denote a maximal 
independent set of Li . We find j to be the threshold index, which is the smallest 
index i, with deg{v) > q in Li for v G D, and \I{Li) \ < k. Since i*, the edge index 
of the optimal solution satisfies the above two conditions, and no two demands 
in I {Li) have edges from the same supplier in Gi for 1 < i < m, we have j < i* 
leading to w{ej) < OPT. 

We shift each demand v G I{Lj) to its cheapest supplier gj{v) with the lowest 
weight among suppliers having an edge from v in Gj. This forms the center set 
U = {gj{v)\v G I{Lj)}, which provides at most k centers with at most a 3 • OPT 
weighted distance. To see this, we prove the following theorem. 
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Theorem 10. An approximation factor of 3 can he achieved polynomially for 
the weighted q-coverage k-center problem. 

Proof Note \U\ < \I{Lj)\ < k and w{ej) < OPT. To obtain the approximation 
factor of 3, we need only show the following two facts: (1) each demand u G D 
is covered by a vertex in U within at most 3w{ej); (2) each supplier gj{v) G U, 
where v G covers at least q demands in D within at most 3w{ej). See [10] 

for details. □ 



5.2 Any q with Weights and Costs 

An approximation factor of (2/3+1) can be obtained for the g-coverage fc-supplier 
problem with weights and costs. When /3 = 1, it ensures an approximation 
factor of 3 for the cost only case. Compared with the algorithm for the weighted 
g-coverage k supplier problem, it is changed as follows. 

After finding I {Li) for 1 < / < to, we shift each demand v G I {Li) to its 
cheapest supplier Si{v) with the lowest cost among all the suppliers having an 
edge from v in Gi. This forms Ui = {si(f)|u G I{Li)}. The threshold index j 
is the smallest index i, giving deg{v) > g in for u G U and the total cost of 
vertices in Ui, c{Ui), is at most k. Since no two demands in I{Li) have edges 
from the same supplier in Gi for 1 < / < to, we have w{cj) < OPT. 

Now, we can prove that Uj have at most k cost centers within at most 
(2/3 + 1) • OPT weighted distance to establish the following theorem, by similar 
arguments in Theorem 10. (See [10] for details.) 

Theorem 11. An approximation factor o/ (2/3+1) can he achieved polynomially 
for the weighted and cost q-coverage k-center problem. 

6 Conclusion 

We studied a new fc-center problem which ensures minimum coverage of clients by 
centers. It is motivated by the need to balance services provided by centers while 
allowing centers to be utilized fully. We considered three variants of the problem 
by examining their in-approximation hardness and approximation algorithms. 
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Abstract. A c.e. real x is Solovay reducible (S-reducible) to another 
c.e. real j/ if j/ is at least as difficult to be approximated as x. In this 
case, y is at least as random as x. Thus, the S-reducibility classihes rela- 
tive randomness of c.e. reals such that the c.e. random reals are complete 
in the class of c.e. reals under the S-reducibility. In this paper we investi- 
gate extensions of the S-reducibility outside the c.e. reals. We show that 
the straightforward extension does not behave satisfactorily. Then we 
introduce two new extensions which coincide with the S-reducibility on 
the c.e. reals and behave reasonably outside the c.e. reals. Both of these 
extensions imply the rH-reducibility of Downey, Hirschfeldt and LaForte 
[6]. At last we show that even under the rH-reducibility the computably 
approximable random reals cannot be characterized as complete elements 
of this reduction. 

1 Introduction 

According to Turing [13], a real number x G [0,1] is computable if it has a 
computable binary (or decimal) expansion, i.e., x = xa '■= Emga for 

a computable set A C N. Thus, the computable reals have the simplest binary 
expansions. On the other hand, the binary expansion of a random real should be 
the most complicated, i.e., random. Here the randomness of a binary sequence a 
can be described by the “typicalness” that it does not belong to any “small” class 
according to Martin-L6f [9], or equivalently, by “chaoticness” which means that 
the sequence is disordered and complex such that the simplest way to describe 
it is to write all its members. That is, H (a \ n) > n — c for some constant c 
and all n, where H{a [ n) is the prefix-free Kolmogorov complexity of the initial 
segment a [ n of length n (see, e.g. [5] for more details). 

Relative computability is studied by means of various reducibilities. Simi- 
larly, Solovay’s domination relation of [12] compares the relative randomness of 
the c.e. (computably enumerable) reals, where c.e. reals are limits of increasing 
computable sequences of rationals. According to Solovay [12], a c.e. real y domi- 
nates another c.e. real x (or more popularly, x is Solovay reducible or S-reducible 
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to y), if y is at least as difficult to be approximated as x. More precisely, we 
have the following definition, where CS{x) {lCS{x)) is the class of computable 
(increasing) sequences of rationals converging to x. 

Definition 1 (Solovay [12]). A c.e. real x is Solovay reducible to a c.e. real y 
{x <s y) if there are {xs) € TCS(x), (y^) G XCS(y) and a constant c such that 

(Vs G N) (a; - Xs < c{y - y*)) . (1) 

Solovay shows in [12] that S-reducibility has the following important Solovay 
property that, for any c.e. reals x and y 

x<sy^ (3c)(Vn)(iV(x \ n) < H{y \ n) + c). (2) 

In other words, if x <s then x cannot be not more random than y. Thus, 
Solovay reducibility classifies relative randomness of c.e. reals. Since Solovay 
reducibility is reflexive and transitive, the induced equivalent classes are defined 
as Solovay degrees (S-degrees). Downey, Hirschfeldt, and Nies [7] showed that 
the S-degrees of all c.e. reals form a dense distributive uppersemilattice, where 
the natural join operation is induced simply by addition. As expected, the least 
S-degree consists of all computable reals. More interestingly, the class of all 
c.e. random reals forms the largest S-degree among the c.e. S-degrees. This is 
proven by a series of works. At first, Chaitin [4] introduced 17 reals as halting 
probabilities of universal self-delimited Turing machines and showed that every 
17 real is c.e. random. Solovay [12] called a c.e. real 17-Iike if it is complete under 
the Solovay reducibility on the c.e. reals, i.e., every c.e. real is S-reducible to it. 
Then he showed that every 17 real is 17-Iike and every 17-Iike real is c.e. random. 
Next, Calude, Hertling, Khoussainov and Wang [3], showed that every 17-like 
real is actually an 17 real. Finally, Kugera and Slaman [8] closed this circle by 
showing that any c.e. random real is 17-like. Therefore, the notions 17 real, 17-like 
real and c.e. random real are equivalent. 

It is well known that Solovay reducibility is a good tool to study relative 
randomness of c.e. reals. Therefore, it is interesting how this reducibility can be 
extended to larger classes of reals. To this end, let’s first recall some extensions 
of the class of c.e. reals. 

As the limits of increasing computable sequences of rationals, c.e. reals are 
also called left computable because they are computably approximable from the 
left side on the real axes. Symmetrically, right computable reals are the lim- 
its of decreasing computable sequences of rationals. Left and right computable 
reals are called semi- computable. The classes of computable, left computable, 
right computable and semi-computable reals are denoted by EC,LC,RC and 
SC, respectively. A first interesting extension of c.e. reals is the class of weakly 
computable reals by Ambos-Spies, Weihrauch and Zheng [1]. A real x is weakly 
computable if it is the difference of two c.e. reals. Thus, weakly computable re- 
als are also called d-c.e (difference of c.e. reals). It is shown in [I], that x is 
weakly computable iff there is an (xg) G CS{x) such that ~ a^s-i-i| 

is bounded and the class of weakly computable reals (denoted by WC) is 
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the arithmetical closure of all c.e. reals. More generally, the limit of a com- 
putable sequence of rationals (without any restriction) is called computably ap- 
proximable (c.a., for short) and CA denotes the class of all c.a. reals. Then we 
have EC C LC C SC C WC C CA as shown in [1]. 

Another kind of extension of c.e. reals is motivated by the notion of monotone 
convergence of Calude and Hertling [2] and is introduced by the authors [10]. 
Let /i : N ^ Q be a function. A sequence (xg) G CS(x) converges to x h- 
monotonically if [a; — Xtj < h{s)\x — x.s\ for all s < t. A real x is called h- 
monotonically computable (/i-mc, for short) if there is an (xg) G CS{x) which 
converges to x h-monotonically. x is called monotonically computable (me for 
short) or oj -monotonically computable (w-mc, for short) if it is h-mc for a constant 
function /i = c or a computable function h, respectively. We denote by MC and 
w-MC the classes of me and uj-mc reals, respectively. It is shown in [11, 10] that 
SC C MC C w-MC C CA and MC C WC, while the classes WC and w-MC 
are incomparable. 

As the first extension of the S-reducibility, the S2-reducibility is obtained 
directly by replacing the increasing computable sequence in Definition 1 by a 
general computable sequence. This extends the S-reducibility to the class of 
c.a. reals. The S2-reducibility coincides with the S-reducibility on c.e. reals and 
works also satisfactorily on the class MC. However, it is quite ill-behaved outside 
the class MC (Some of these ill-behaviors of the straightforward extension of the 
S-reducibility outside the c.e. reals were also pointed out by Downey, Hirschfeldt 
and LaForte in [6]). For example, S2-reducibility is not transitive on WC and 
some weakly computable real is not above a computable real, which contradicts 
the intuition of a reducibility respecting relative randomness. 

As alternatives, Downey, Hirschfeldt and LaForte introduced in [6] two new 
reducibilities, the strong weak truth table reducibility (sw-reducibility) and the 
relative H reducibility (rH-reducibility) . Both are defined for all reals and have the 
Solovay property. Nevertheless, the sw-reducibility is not comparable with the S- 
reducibility on the c.e. reals (as shown in [6]) and there is no maximal sw-degree 
of c.e. reals as shown by Yu and Ding [14]. The rH-reducibility, on the other 
hand, has some nice properties. But it does not coincide with the S-reducibility 
on the c.e. reals. In this sense, both sw-reducibility and rH-reducibility are not 
really extensions of the S-reducibility. 

In this paper, two natural extensions of the S-reducibility are introduced 
which coincide with the S-reducibility on the c.e. reals and behave reasonably 
well outside the c.e. reals. The first one (the S2a-reducibility) works on the class 
CA which is obtained by introducing a small tolerance in the definition of S2- 
reducibility. Namely, instead of an inequation like \xg—x\ < c|j/s — ?/|, we require 
\xg—x\ < c{\yg — y\-\-2~‘^) in the reduction of a; to y. It is this small change which 
removes all ill-behaviors mentioned above. In addition, the S2a-reducibility is 
also robust in the sense that several different approaches lead to the same re- 
ducibility. The second one (the Sla-reducibility) is also a tolerance-included 
variation of another reduction Sl-reducibility. Both of them are defined on all 
reals. We will show that the Sla-reducibility coincide with the S2a-reducibility 
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on the c.a. reals and behave reasonably on the c.a. reals, while Sl-reducibility 
has some ill-behaviors already on the class WC. Thus, the Sla-reducibility is a 
natural extension of the S-reducibility to all reals such that its restrictions on 
LC and C A are just the S2a-reducibility and the S-reducibility, respectively. For 
this reason, we can call the Sla-reducibility (generalized) Solovay reducibility. 

Finally, we show that all reducibilities introduced in this paper imply the 
rH-reducibility hence have the Solovay property. However, as we show at last, 
even the rH-reducibility has also a shortcoming that there is no maximal element 
under the rH-reducibility among the c.a. reals. 



2 Straightforward Extension to CA Reals 

In this section we investigate the straightforward extension of the S-reducibility 
to CA and analyze why it is not a suitable reducibility to study relative random- 
ness. This extension is obtained directly by replacing the increasing computable 
sequences by a general computable sequences of rationals in Definition 1 . 

Definition 2. Let x,y be c.a. reals, x is S2-reducihle to y (x <| y) if there are 
(xs) G CS(x), (ys) G CS(y) and a constant c such that 

{ysGN){\x-Xs\<c\y-ys\). (3) 

Obviously, <| is reflexive. Actually, it is also transitive on MC. 

Theorem 1. S2-reducihility is transitive on the class MC. 

Proof. The result follows from the following claim whose proof is omitted here. 

Claim. If X <| y and y G MC, then for any computable sequence {ys) which 
converges to y ci-monotonically for some constant ci, there are a computable 
sequences (xg) and constants c which satisfy condition (3). 

Suppose now that x <| y, y <| 2 and z G MC. Let (zg) be a computable 
sequence of rationals which converges to z ci-monotonically for some constant 
Cl > 1. Then, by the above claim, there are (rtg) G CS{y) and a constant C 2 
which witness the reduction y <| z. Suppose furthermore that (a;^) G CS(x) 
and (yg) G CS(y) witness the reduction x <| y with a constant C 3 . Assume 
w.l.o.g. that z is irrational and any two elements of (zg) are different. Then 
define a computable function g by g{s) := min{t > s \ \ut — yt\ < \zg — Zt\\. 
This implies that \x - a:g(g)| < cs\y - < cs^y - Ug(g)| -f |Mg(g) - j/g(g)|) < 

C 3 {c 2 \z - Zg(g)| -I- |zg - Zg(g)l) < C 3 (C 2 -f l)/(ci - l)|z - Zg|. That is, X <1 z. 

Thus, the S2-reducibility is reflexive and transitive on MC and it induces a 
natural S2-degree structure. By the next result, the computable reals form the 
least S2-degree in this structure. 

Theorem 2. If x G EC and y G w-MC, then x <| y. 

Moreover, on c.e. reals, the S2-reducibility coincides with the S-reducibility. 
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Theorem 3. For any c.e. reals x and y, we have x <s y iff x <| y. 

Therefore, S2-reducibility is really an extension of the S-reducibility to MC. 
However, the next theorem shows that the S2-reducibility behaves not satisfac- 
torily outside the class MC. 

Theorem 4. 1. (fix G EC \ Q){3y G WC)(a: y); 

2. S2-reducibility is not transitive on WC. 

Proof. 1. For any irrational computable real x, choose a computable sequence 
(a;g) of rationals such that |a; — a;*] < 2“^ for all s. We will determine a y G WC 
such that X y. That is, y satisfies all the following requirements 



^Pi, ipj are total 

R(r,j,k) ■ linis^oo 'F^{s) = a: > ^ (3s)(|a; - <pi(s)| > k ■ \y - (fij(,s)\) 

lim^^oo <fj (s) = y J 

where (tpi) is an effective enumeration of all computable functions :C N ^ Q. 

To satisfy a single requirement Re for e = {i,j,k), we will define an open 
witness interval / such that every y G I satisfies Re- At the beginning, let 
I := (a;b) for some a,b gQ. If the sequence (y?j(s)) does not enter I, then we 
are done because lim^^oo Fj{s) y. Otherwise, suppose that G I for some 
t. Since tpi is total and x is irrational, we have >fi(t) x = lims^oo Xg- Choose 
an s large enough such that la;^ — >.pi{i)\ > 2“®+^. This implies that \ipi{t) — a;| > 
\LPi(f) — x s \ — \x — X s\ > 2“®. Let e := min{2“®/(/c + l),Lpj(f) — a, b— ipj(t)} and 
define J := — e; -I- e). Thus, J is a witness interval of Re- 

To satisfy all requirements simultaneously, we construct a sequence (Jg) of 
witness intervals by a standard finite injury construction such that R is a witness 
interval, le+i C R and limg^oo ({le) = 0 where 1{I) is the length of the interval 
I . In this case, the unique common real y of all intervals satisfies all requirements. 
Furthermore, in order to guarantee that y is weakly computable, the length 1{R) 
of le should be less than 2“^®. 

2. By item 1, let x G EC and z G WC such that x z. Let y be a rational 
number. Then we have obviously that x y and y z. 

The shortcomings of the S2-reducibility in Theorem 4 is fatal for a reducibility 
to classify relative randomness, because it is obvious that computable reals have 
the least, i.e., no randomness. Therefore, we have to look for other reducibilities 
if we hope to extend the S-reducibility to a more general class than MC. 
Remark: The S-reducibility can be characterized by several different ways. For 
example, x <s y iff for all (y^) G 2CS{y), there exists an (x^) G 2CS{x), a 
constant c and a computable function / such that x — Xg < c{y — yf{g)) for all 
s; and iff for all (xg) G 2CS{x) and (y^) G 2CS{y), there exists a computable 
function / and a constant c such that x — Xf(^g) < c(y — y/(s)) for all s, etc. The 
corresponding extensions of <s to CA can also be based on these characteri- 
zations. However, they are all stronger strictly than S2-reducibility and behave 
even worse. Therefore we do not discuss them here in detail. 
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3 The Reducibility on CA Reals with Tolerance 

The S2-reducibility introduced in the last section does not behave very well 
outside the class MC. A better one will be introduced in this section by in- 
troducing a tolerance in the definition of S2-reducibility. Notice that the S- 
reducibility can be defined equivalently as follows. For c.e. reals x and y, x <s y 
iff there are a constant c and (x^) G TCS{x) and (ys) G TCS{y) such that 
(Vs)(a; — Xs < c{y — ys + 2“^)). This leads to our following definition. 

Definition 3. For x,y G CA, x is S2a-reducible to y (denoted by x <|“ y) if 
there are (xg) G CS{x), (ys) G CS{y) and a constant c such that 

(Vs)(|a; - a;*| < c(|y - -h 2"®)). (4) 

Theorem 5. 1. The S2a-reducihility is reflexive and transitive on CA; 

2. {Wx,y G LC) (x <|“ y x <% y x <s y); 

3. (Vx, y) [x G EC Sz y € CA =;> x <|“ y) . 

Proof. 1. Refiexivity is straightforward. We show now the transitivity. Suppose 
that, with a common constant c, the computable sequences (xg) and (yg) witness 
the reduction a; <|“ y and the computable sequences (ug) and (zg) witness the 
reduction y <|“ z. Since both computable sequences (y^) and (u^) converge to 
y, we can define an increasing total computable function / : N ^ N such that 
!“/(«) ~ yf(s) \ — and s < f{s) for all s. Thus, we have 

k - Xf(^g) I < c (|y - y/(s) I -h < c (|y - u/(*) | -h - y/(*) | -h 2"^) 

< C (^C (^\z — Zf(g)\ +2 -I- 2 • 2 < c' (|z — Zy(s)| -I- 2 

for c' = maxjc^, 2c}. That is, x <|“ z. 

2. The implications <s=^<| and are straightforward. We prove now 

the implication 

Let X, y be non-computable c.e. reals such that x <|“ y. There are a constant 
c and computable sequences {xg) and (y^) of rationals which satisfy (4). The se- 
quences (xs) and (ys) are of course not necessarily monotone. However, because 
X is c.e. but not computable, there is an increasing computable function / such 
that {xf(g)) is an increasing computable sequence which converges to x. Cor- 
respondingly, the computable sequence (y/(s)) converges to a non-computable 
c.e. real y and hence there is a computable function g such that {ygf(g)) is an 
increasing computable sequence which converges to y. Obviously, the sequences 
(xgy(s)) and {ygf(g)) witness the reduction x y too. That is, we have 

a: - Xgf^g) < c{y - ygf(g) + = c{y - {ygf(g) - 

Since lims^oo(yg/(s) — 2“®^^®)) = y and ygf(g) — < y, there is an increas- 

ing computable function h such that (ug) is an increasing computable sequence 
converging to y where Vg := yhgf{s) — Accordingly, {ug) is an increasing 
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computable sequence converging to x for Us := Xhgf(s)- Thus, the sequences (us) 
and (ws) witness the reduction x <s y- 
3. is straightforward. 

Theorem 5 shows that the S2a-reducibility is a natural extension of the S- 
reducibility to CA. Besides, this reducibility is also robust under slight variations 
in the definition. As an example, we show one of such equivalent variation. 

Theorem 6. For c.a. reals x and y, x y if and only if for any (xg) G CS(x) 
and {ys) G CS{y), there are an increasing computable functions f and a constant 
c such that (Vs)(|a; — a;/(s)| < c{\y — yf(s)\ + 2“^). 

Proof. The “if” part is easy. We prove now the “only if” part. 

Suppose that x <|“ y and this reduction is witnessed by (ug) G CS(x) and 
(fs) G CS{y) together with a constant c > 1. For any (xg) G CS(x) and (yg) G 
CS{y) we define an increasing computable functions / inductively by /(—I) = 0 
and /(n + 1) := min{t > /(n) : \ut — xt\ < 2“^”+^) & \yt — vt \ < 2“("+^))} for 
all n. The function / is well defined because limg^oo Us = lims^oo Xg = x and 
limg^oo Vs = limg^oo ys = y- Since /(s) > s + 1, for all s, we have 

\x - Xf(^g)\ <\x- u/(s)| + \uf(^g) - Xf(^g)\ < c{\y - 

< c{\y - yfis)\ + \yfis) - vf(^g)\ + 2-(*+i)) + 2"(*+2) 

< c{\y - ?//(,) I + 2 -(^+ 2 ) + + 2 -(*+ 2 ) 

< c(|?/-2//(s)| +2"^)). 

Theorem 7. If y is random and x <|“ y, then x <| y. 

Proof. Suppose that x <|“ y and the computable sequences (a;*) and {yg) witness 
the reduction with a constant c. Since y is random, there is a constant Ci such 
that \y — yg\ > ci • 2“^ for all s. This implies that |x — a;g| < c{\y — yg\ + 2“®) < 
c(l + c\)\y — ys\ for all s. That is, x <| y. 

4 Reducibilities on All Reals 

In this section we extend the S2a-reducibility to the Sla-reducibility which works 
on all reals. We will see that also in this case the Sla-reducibility which involves 
the tolerance works more reasonably than the Sl-reducibility (the variation with- 
out tolerance). Furthermore, the restrictions of the Sla-reducibility to the classes 
CA and LC are just the S2a-reducibility and the S-reducibility, respectively. In 
addition, Sla-reducibility implies rH-reducibility. At last we show that, even 
under rH-reducibility, there is no greatest degree of the c.a. reals. 

Definition 4. Let x,y be reals, x is Sl-reducible to y (denoted by x <g y) if 
there exist a partial computable function / :C Q ^ Q and a constant c such 
that y is a limit point of dom(/) which satisfies 

(Vr G dom(/)) {\x - f{r)\ < c\y - r|) . (5) 
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Correspondingly, x is Sla-reducihle to y (x y), if (5) is replaced by 

(Vr G dom(/)) - /(r)| < c{\y - r\ + (6) 

where l{r) is the length of the rational r (as a binary word). 

Theorem 8. 1. Nx,y G K)(a: <% y x <q y Sz x <%°‘ y x y); 

2. {\/x,y G CA)(a: <g y x <% y t x <^g y x <|“ y). 

From Theorem 8, the reducibility <g (<s“) coincides with <| (<|“) on the 
class CA. In addition, the class CA is closed downward under SI- and Sla- 
reducibilities. By a finite injury priority construction similar to the proof of 
Theorem 4.1 we can show the following. 

Theorem 9. (Vx G EC \ Q){3y G WC)(a; y) 

Thus, the reducibility <5 is not suitable at least on the class WC. The 
Sla-reducibility, on the other hand, behave much better. 

Theorem 10. 1. (Va;,y G K)(a; G EC x y); 

2. The S 1 a-reducihility is reflexive and transitive. 

Proof. 1. Let a; be a computable real and (x^) a computable sequence of rationals 
such that \x — a;*] < for all s. Define /(r) := Xg for s = l{r). Then / is a 
computable function such that \x — /(r)| < 2“^^'’^ for any r G Q and hence / 
witnesses the reduction x <^g y with c := 1. 

2. We prove the transitivity only. Let / and g be computable functions which 
witness the reduction x <^g y and y z together with a common constant 
c. We construct a Turing machine M which computes a function h :C Q —>■ Q 
as follows: given any r G dom(g), M looks for the first ri G dom(/) such that 
|ri| > |r| and |ri — g{r)\ < 2“I'’L Then M outputs f{r\). 

It is not difficult to see that dom(h) = dom(g) and hence a; is a limit point 
of dom(/i). In addition, for any r G dom(/i), we have |a: — h{r)\ = |a: — /(?"i)| < 
c{\y — ri| -I- 2“l’’il) < c{\y — g{r)\ + |5(r) —ri\+ < c^Qz — r| -|- 3 • 2“l’’il) < 

c'{\z — r| -I- 2“l’’l). That is, x z. 

At last, we compare the Sla-reducibility with the rH-reducibility introduced 
by Downey, Hirschfeldt and LaForte [6]. Let’s recall the definition first. 

Definition 5 (Downey, Hirschfeldt and LaForte [6]). For reals x and y, x 

is rH-reducible to y (denoted x <th y) if there are a constant k and a computable 
function / such that 

(Vn G N)(3j < k) {f{y \ n,j) i= y \ n) . (7) 

It is shown in [6] that the S-reducibility is strictly stronger than the rH- 
reducibility and the rH-reducibility has the Solovay property. Now we show that 
the Sla-reducibility implies the rH-reducibility. 
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Theorem 11. (Vx,?/ G K)(a: y x <rH y)- 

Proof. Suppose that x y and, w.l.o.g. y is irrational. Let c be a constant 
and / :C Q ^ Q be a computable function which witness the reduction x y, 
that is, they satisfy condition (6). For any binary word a of length n, let 

5, := G {0, 1}" : (3r G dom(/)) (a C r & |/(r) - ^| < 2"("+2)) } . 

Obviously, So- can be computably enumerated uniformly on cr. That is, there is 
a computable function :C Q x N ^ Q such that (p{a, 0),ip{a, 1), (p{a, 2), • • • are 
one-one enumerations of So- for all cr. 

Let be a natural number such that c < 2^ and a = y \ n for some n. Now 
we show that |<S'cr| < Suppose that ti,T 2 G dom(/) are binary words such 

that cr C n, cr C T 2 and ti \ (n-|-fV-|-3) = T 2 \ (n+N+3) = y \ (n-|-N-|-3), then 
we have |/(ri) - /(t 2 )| < |/(n) - a;| -b |/(t 2 ) - x| < c(|n - y| -b |t 2 - y| -b 
2 ~H'^ 2 )'j < 2^ • 4 • Let U\,U 2 G {0, 1}" be binary words of 

Sa corresponding to n and T 2 , respectively. Then we have |/(ti) — ui | < 
and |/(t 2 ) — U 2 I < 2“(”+^\ and hence \u\ — U 2 I < |mi — /(n)| + |u 2 — /(t 2 )| -b 
l/(n) - f{T 2 )\ < 2-("+2) -b2-(”+2) -b2-("+i) = 2-". This implies that m = U 2 
because the length of ui and U 2 are n. On the other hand, there are only 
different binary words of length n -b fV -b 3 which extend the word cr of length n. 
Therefore, jS'^l < 2^+^ holds. 

Finally, since y is a limit point of dom(/) and irrational, for any n and 
a = y \ n, there exists a r G dom(/) such that y \ (n + N + 3) E t , i.e., |y — r| < 
2-{n+N+3) ^ This implies that |x — /(t)| < 2“^”+^) and hence x \ n = /(r) \ n. 
Let u := /(t) f n. Then |/(r) — u| < |/(r) — a;|-b|a; — u| < 2“(”+^). That is, u is an 
element of and hence there is a j < 2-^+^ such that ip(y \ n,j) = u = x f n. 
That is, the computable function (p and constant 2-^+^ witness the reduction 

^ —rH y- 

Corollary 1. Reducihilities and <g“ have the Solovay property. 

In summary, the Sla-reducibility seems to be a quite natural reduction to 
study relative randomness. Its restrictions on the classes CA and LC are just the 
S2a-reducibility and the Solovay reducibility, respectively. The rH-reducibility is 
a more general and weaker reduction than Sla-reducibility. The next theorem 
shows that even the rH-reducibility does not have a greatest degree of c.a. reals 
and hence the c.a. randomness cannot be captured by rH-completeness. 

Theorem 12. (Vy G CA)(3a; G CA)(a; ^rH y)- 

Proof. Let y be a c.a. real number and (y^) a computable sequence of rationals 
which converges to y. We construct a computable sequence (x^) of rationals 
which converges to x such that x y- That is, they satisfy the following 
requirements 



: (3n G N)(Vfc < j) (y?*(y \ n,k) ^ x \ n) 
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where (tpi) is an effective enumeration of all computable functions (pi :C {0, 1}* x 

{ 0 , 1 }*. 

To satisfy all requirements simultaneously, we define a sequence (rig) by 
n_i := 0 and Ue+i ■= rig + 7T2(e) + 2 for all e first, where 7T2 is the second 
inverse function of the pairing function (•, •). That is, 7Te((z,j)) = j. At any 
stage s, if Pi^siUs \ ne,k) is defined for some k < j, then we define a binary 
words Xg in such a way that 

1. Xs-i r Ue-i = X.S \ rie-i; and 

2- Pt,s{ys \ ne,k)[ne-i + k] ^ Xs[ne-i + k]. 

Thus, (xs) is a computable sequence of rationals which converges to a real num- 
ber X which satisfies all requirements Rg. That is, x ^rH V- 
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Abstract. We study the computational complexity of counting the num- 
ber of solutions to systems of equations over a fixed finite semigroup. 
We show that if the semigroup is a group, the problem is tractable if 
the group is Abelian and #P-complete otherwise. If the semigroup is a 
monoid (that is not a group) the problem is ^P-complete. In the case of 
semigroups where all elements have divisors we show that the problem is 
tractable if the semigroup is a direct product of an Abelian group and a 
rectangular band, and ^P-complete otherwise. The class of semigroups 
where all elements have divisors contains most of the interesting semi- 
groups e.g. regular semigroups. These results are proved by the use of 
powerful techniques from universal algebra. 



1 Introduction 

The computational complexity of deciding whether systems of equations over a 
fixed finite semigroup are solvable has been intensively studied in the past. In 
[3], it is proved that when the semigroup is a group, the problem is tractable if 
the group is Abelian and NP-complete otherwise. This line of research continued 
in [6,7], where the corresponding problem for finite monoids was given a com- 
plete solution. Some partial results in the general case of finite semigroups have 
been proved in [7]. Note that even the restricted problem of determining the 
computational complexity of solving systems of equations over a fixed regular 
semigroup is still open. 

In this paper we study the computational complexity of counting solutions 
to systems of equations over a fixed finite semigroup. This problem has not been 
given the same attention as the corresponding decision problem. A system of 
equations over a fixed finite semigroup (S', •) is a collection of equations of the 
form w\ • W 2 • ■ ■ ■ • Wk = yi • V 2 ■ ■ ■ ■ ■ Vj where each Wi and yi is either a variable 
or a constant in S. We denote the problem of counting the number of solutions 
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to systems of equations over the semigroup (S,-) by ^EQNg. The notation 
^EQNg is used in order to comply with the notation in [3,6,7]. The presence 
of the * in ^EQNg is motivated by the need to distinguish this problem from 
the problem of counting the number of solutions to a single equation over (S,-) 
(denoted by ^EQNs). The complexity of ^EQNg is measured in the size of 
the systems of equations (the size of {S, •) is fixed and does not matter). 

We prove that ^EQNq is tractable when (G, •) is an Abelian group and #P- 
complete when (G, •) is a non-Abelian group. In the case of monoids, ^EQN’^ 
is ^P-complete when (M, •) is a monoid that is not a group. Given a semigroup 
(S', •), we say that an element a in S is divisible if and only if there exists 6, c in S 
such that 6 • c = a. In the case of semigroups where all elements are divisible we 
show that ^EQNg is tractable if (S, •) is a direct product of an Abelian group 
and a rectangular band, and #P-complete otherwise. The class of semigroups 
where all elements have divisors contains most of the interesting semigroups, in 
particular regular semigroups. Moreover we prove that if (S', •) is a semigroup 
with zero, then ^EQNg is tractable if (S, •) is such that for all elements x, y in S, 
x-y = 0, and ^P-complete otherwise. Important to note is that all the conditions 
we impose on our semigroups to ensure the tractability (or ^P-completeness) of 
^EQNg can be checked in polynomial time (in the size of the semigroups). 

Our approach for obtaining dichotomy theorems for ^EQNg relies heavily 
on the use of powerful algebraic techniques that have previously shown to be 
very useful in proving dichotomy theorems for Constraint Satisfaction Problems 
(CSPs) [1,2]. It is our belief that similar algebraic techniques can also be used 
in a unified approach to determine the complexity of the decision problem. 

The paper is organized as follows. In Section 2 we give the basic definitions 
and results needed in the rest of this paper. Section 3 contains all our tractability 
proofs for ^EQNg. In Section 4 we prove the dichotomy for the computational 
complexity of ^EQNg when (S', •) is a group or monoid. In Section 5 we prove 
the dichotomy for the computational complexity of ^EQNg when (S, •) is a 
semigroup with zero or is in the class of semigroups where all elements are 
divisible. 

2 Preliminaries 

2.1 CSPs 

In this section we introduce the notation and basic results on CSPs that we will 
use in the rest of this paper. 

Let D be a finite set. The set of all n-tuples of elements of D is denoted by 
D". Any subset of Z?” is called an n-ary relation on D. The set of all finitary 
relations over D is denoted hy Rd- 

Definition 1. A constraint language over a finite set, D, is an arbitrary set 
E Q Rd- 

Definition 2. The counting constraint satisfaction problem over the constraint 
language E , denoted fj^CSP{E), is defined to be the counting problem with in- 
stance {V,D,C), where 
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— V is a finite set of variables, 

— D is a set of values (sometimes called a domain), and 

— C is a set of constraints {Ci, . . . , Cg}, in which each constraint Ci is a pair 
(sj, Qi) with Si a list of variables of length rrii, called the constraint scope, and 
Qi an rrii-ary relation over the set D, belonging to F, called the constraint 
relation. 

We ask for the number of solutions to (V,D,C), that is, the number of functions 
from V to D such that, for each constraint in C , the image of the constraint 
scope is a member of the constraint relation. 

The corresponding decision problem where we are interested in the existence of 
a solution is denoted CSP{F). In the context of this paper D is the elements of a 
finite semigroup ( 5 , •) and F is the set of relations expressible as equations over 
{S, •) (denoted by Fs). For example the equation x-y = c where x, y are variables 
and c is a constant gives rise to the following relation R = {{x,y)\x • y = c}. 

Next we consider operations on the domain D. Any operation on D can be 
extended in a standard way to an operation on tuples over D, as follows. 

Definition 3 . Let f be a k-ary operation on D and let R be an n-ary re- 
lation over D. For any collection of k tuples, ti,t2,-..,tk G R, the n-tuple 
f{ti,t2, . . . ,tk) is defined as follows: f{ti,t2, . . . ,tk) = 



{f{ti[l],t2[l],...,tk[l]),f{ti[2],t2[2],...,tk[2]),...,f{ti[n],t2[n],...,tk[n])) 



where tfii] is the i:th component in tuple tj. 

A technique that has shown to be useful in determining the computational com- 
plexity of CSP{F) is that of investigating whether F is closed under certain 
families of operations [ 5 ] . 

Definition 4 . For any constraint relation Qi € F if f is an operation such that 
for all ti,t2, . . . ,tk € Qi.' f{t\,t2, . . . ,tk) G Qi, then Qi is closed under f. If 
all constraint relations in F are closed under f then F is closed under f. An 
operation f such that F is closed under f is called a polymorphism of F. The 
set of all polymorphisms of F is denoted Pol{F). 

In this paper we will only be concerned with a particular family of ternary 
operations called Mal’tsev operations. An operation M is a Mal’tsev operation 
if and only if for all x, y, M {x, y,y) = M(y, y, x) = x. 

The following result provides us with the key to prove the #P-completeness 
of constraint languages. 



Theorem 1 ([ 2 ]). If F is a finite constraint language that is closed under no 
Mal’tsev operation then ffCSP{F) is #P-complete. 
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Throughout this paper we will use the following notation for the application of 
an arbitrary Mal’tsev operation M on tuples in a relation R. 

(a, b) e R 
{b,b) G R 
{b, a) G i? 

M 

{a, a) 

In the example above {a,b),{b,b) and (6, a) are three tuples in a relation R. If 
(a, a) ^ R we have shown that R is closed under no Mal’tsev operation, since 
(M{a,b,b), M{b,b,a)) = {a, a) for any Mal’tsev operation M. 

All of our ^P-completeness proofs for ^EQNg follows this outline: First we 
choose a relation R in Rs i.e., R is expressed by a specific equation E over {S, ■). 
Then we show that R is closed under no Mal’tsev operation, thus proving the 
#P-completeness of =ffCSP{Es)- It should be clear that this implies the #P- 
completeness of ^EQNg. It is also easy to realize that any system of equations 
over a finite semigroup (5, •) can be (parsimoniously) reduced in polynomial 
time to a system of equations over (S', •) such that no equation in this system of 
equations contains more than three variables. Equations containing more that 
three variables can be split into smaller parts by introducing new variables. 
Hence we can safely assume that all constraint languages we consider are finite. 

The following theorem is very useful when it comes to proving tractability 
results for #C'SP(T). 

Theorem 2 ([2]). A Mal’tsev operation M{x,y,z) = x * y~^ * z where * and 
are the operations of an Abelian group is called an affine operation. If M G 
Pol{r) then ffCSP{P) is tractable. 

2.2 Definitions and Results from Semigroup Theory 

For the convenience of the reader we here give the definitions and results from 
semigroup theory that we use in the rest of this paper. The proofs are available 
in (or easily deducible from) any standard reference on semigroup theory such 
as [4]. 

A semigroup {S, •) is a set S with an associative binary operation •. A semi- 
group with zero is a semigroup that contains an element 0 such that for all x in 
S', 0-a: = a;-0 = 0. A monoid is a semigroup that has an identity element. A 
band is a semigroup where all elements are idempotent (that is for all x in S, 
X ■ x = x). A rectangular band is a band where for all a, 6 in S, a • 6 • a = a. A 
regular semigroup is a semigroup where for all a in S there exists an element b 
in S such that a ■ b ■ a = a. A right zero semigroup is a semigroup where for all 
a,b in S, a ■ b = b {left zero semigroups are defined in an analogous way). If two 
semigroups (S, •«) and {T,-t) are isomorphic we denote this by (S, •«) = {T,-t). 
If (S, -s) and (T, •() are semigroups, then the Cartesian product (S, •«) x (T, -t) 
becomes a semigroup if we define {s,t) ■ {s' ,t') = (s s',t 'tt'). We refer to this 
semigroup as the direct product of (S', -s) and (T, -t). Let Div{S) denote the set 
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of divisible elements in S where (S, •) is a finite semigroup (a is divisible if and 
only if there exists 5, c in S' such that b • c = a). 

Theorem 3. If (S, •) is a finite semigroup and a an arbitrary element in S, then 
o'" is idempotent for some m. 

Theorem 4. If (M, •) is a finite monoid with the identity element as the only 
idempotent then (M, •) is a group. 

Theorem 5. Let (S, •) be a rectangular band, then there exist a left zero semi- 
group (L, •/) and a right zero semigroup {R, >) such that (S, •) = {L, •;) x {R, >). 



The following theorem is folklore. 

Theorem 6. A regular semigroup (S, •) whose idempotents form a rectangular 
band is isomorphic to a direct product of a group and a rectangular band. 

We give some hints on how to prove this theorem. By using Green’s relations 
it can be proved that (S, •) is a completely simple semigroup, and since (S, •) 
is orthodox it follows from the Rees-Suschkewitch Theorem [4] that (S, •) is a 
direct product of a group and the rectangular band of its idempotents. 

3 Tractability Results 

Due to the similar structure of the tractability proofs for the tractable cases of 
ffEQNg we have gathered all of these proofs in this section to make a unified 
approach possible. In the rest of this section assume that all equations in the 
systems of equations are of the following two types: equations only containing 
variables, x\ • . . . ■ Xn = x\ ■ . . . • Xm or equations with a single variable on the 
left hand side and a constant on the right hand side, x = a. We can make this 
assumption because it is easy to reduce (parsimoniously and in polynomial time) 
any system of equations over a finite semigroup into a system of equations of 
this form. 

Let A be finite set and let / : A” ^ A and g : A™ ^ A be two operations. 
If /(a, . . . , a) = a for all a G 7l, then / is idempotent. We say that g commutes 
with / if for all an, ... , aim, ■■■, a„m G A, 

g{f{aii , . . . , flni), . . . , / (aim, . . . , Onm)) = /(fl(aii, . . . , aim), . . . , p(a nl , . . . , anm)) 

The following lemma is the foundation for our tractability results. 

Lemma 1. Let (S', •) be a finite semigroup. If f : ^ S is an idempotent 

operation and ■ commutes with f, then f G PofiRs). 

Proof. Arbitrarily choose a relation R G Ps. If i? is a unary relation (i.e. R{x) 
holds if and only if a; = a for some a G S), then R is closed under / since / is 
idempotent. Otherwise, R = {(xi , . . . ,Xn,xi, . . . , Xm) \ xi - . . .-Xn = xi- . . .■ Xm} 
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and we arbitrarily choose k tuples ti,.. . ,tk in R. Now, R is closed under / if 
the following holds: 

n n+m 

Y[.f{ti\i],...,tk\i]) = f{ti[i],...,tk[i]) (*) 

i—1 

We know that • commutes with / so 

n / n n 

and 

n+m / n+m n+m 

n = / [ n n 

\z=n+l 

Since ti, . . . ,tk are tuples in R, it follows that nr=+.w = n”i++.wforaiij 
and (*) holds. □ 

Lemma 2. Let (S', •) be a finite semigroup. If f is an affine operation and • 
commutes with f , then ffEQNg is tractable. 

Proof. We see that / is idempotent since f{a,a,a) = a * a~^ * a = a so 
f G Pol{Ps) by Lemma 1. Consequently, Theorem 2 implies that ffEQNg is 
tractable. □ 

Our tractability results for ffEQNg are contained in the following theorem. 

Theorem 7. ffEQNg is tractable when 

1. (S, •) is an Abelian group. 

2. (S, •) is a left zero or right zero semigroup. 

3. (S, •) is a semigroup with zero such that for all x,y in S, x ■ y = 0. 

4 . (S, •) = (T, *) X (T', *') and ffEQNf., ffEQNf., are tractable. 

Proof If (S, •) is an Abelian group, then let M {x, y, z) = x-y~^-z. We show that • 
commutes with M and tractability follows from Lemma 2. Let a, b, c be arbitrary 
tuples in S^, then M(a[l], 6[1], c[l]) • M(a[2], 6[2], c[2]) = a[l] • • c[l] • a[2] • 

6[2]-i • c[2] = a[l] • o[2] • (6[1] • 6[2])-i • c[l] • c[2] = M(a[l] • a[2], 6[1] • 6[2], c[l] • c[2]). 

If (S, •) is a right-zero group (the proof for left-zero groups is analogous), 
then let M{x, y,z) = x* y~^ * z where the operation * is chosen such that (S, *) 
is an Abelian group (such a choice is always possible). Let a,b,c be arbitrary 
tuples in S'^, then M(a[l], 6[1], c[l]) • M(a[2], 6[2], c[2]) = M(a[2], 5[2], c[2]) = 
M(a[l] • a[2],6[l] • 6[2],c[l] • c[2]) and tractability follows from Lemma 2. 

If (S', •) is a semigroup with zero such that for all a;, y in S, a; • y = 0, then 
let M{x, y,z) = x * y~^ * z where the operation * is chosen such that (S, *) is 
an Abelian group. Let a, 6, c be arbitrary tuples in S^, then M(a[l], 6[1], c[l]) • 




376 



Gustav Nordh and Peter Jonsson 



M(a[2], 6[2], c[2]) = 0 = M(o[l] •a[2],5[l] •6[2],c[l] • c[2]) and tractability follows 
from Lemma 2. 

If (S', •) = (T, *) X (T', *') , then a system of equations over (S, •) can be 
split into two independent systems of equations, one over (T, *) and one over 
{T',*'). The number of solutions to the system of equations over (S, •) equals 
the product of the number of solutions to the systems of equations over (T, *) 
and (T'j *') respectively. Hence if ^EQN^ and ^EQN^, are tractable, then so 
is #EQN^. □ 

4 Finite Groups and Monoids 

In this section we prove our dichotomies for the complexity of the counting prob- 
lem over finite groups and monoids. Note that the decision dichotomy proved in 
[6] and the counting dichotomy we prove differs in the case of monoids, for ex- 
ample it is tractable to decide whether systems of equations over a commutative 
band are solvable, but to count the number of solutions is #P-complete. 

Theorem 8. Let (G, •) be a finite group, then ffEQNf, is tractable if (G, •) is 
Abelian, and #P-complete otherwise. 

Proof. We begin by proving the #P-completeness of ffEQNf, when (G, •) is 
a non-Abelian group. Because (G, •) is a non-Abelian group there exists two 
elements a, 5 in G such that a ■ b ^ b ■ a. Consider the following relation R = 
{(a;, y)\x-y = y-x}. It is easy to see that R is closed under no Mal’tsev operation. 

(a, 1) G i? a • 1 = 1 • a 
(l,l)Gi? 1-1 = 1-1 
(1,6) Gi? 1-6 = 6-1 

M 

(a,b) ^ R a-b b ■ a 

Hence ffEQNf, is #P-complete for non-Abelian groups. The tractability part is 
proved in Theorem 7. □ 

Next we consider the case where the semigroup is a finite monoid. 

Theorem 9. Let (M, •) be a finite monoid that is not a group, then ffEQNf^ 
is #P-complete. 

Proof. According to Theorem 4 we know that there exist an element a in M 
such that a yf 1 and aa = a. Consider the relation R = {{x,y)\x ■ y = a}. It is 
easy to see that R is closed under no Mal’tsev operation. 

(1, a) G i? 1 • a = a 

(a, a) G i? a ■ a = a 

(o, 1) G i? a • 1 = a 

M 

(1,1) i R 1 • 1 = 1 yf a 

Thus, ffEQNf^ is #P-complete. □ 
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5 Finite Semigroups 

The computational complexity of deciding whether systems of equations over a 
fixed finite semigroup are solvable has been studied in [7], where some evidence 
is given that a dichotomy in the decision case would be very hard to prove. Even 
the restricted problem of proving a dichotomy for regular semigroups is open. 
We prove that for semigroups where all elements have divisors the counting 
problem is tractable if the semigroup is a direct product of an Abelian group 
and a rectangular band, and #P-complete otherwise. Moreover we prove that 
for semigroups with zero, the problem is tractable if the semigroup is such that 
for all elements x,y x • y = 0, and #P-complete otherwise. 

The following lemma will be very useful in the proofs that follows. 

Lemma 3. Let (S', •) be a semigroup and I he the set of idempotents in S. A 
necessary condition for ffEQNg to he tractable is that the following holds: for 
all a in I and for allb,c in S, b • c = b ■ a- c. Otherwise ffEQNg is #P-complete. 



Proof. Assume that there exists a in / and that there exists 6, c in S and b - 
b ■ a • c. Consider the relation R = {(a;, y)\x ■ y = b • a • c} . R is closed under no 
Mal’tsev operation. 



{b,a- c) G R b ■ a ■ c = b • a • c 
{b • a,a ■ c) G R b-a-a-c=b-a-c 
{b ■ a,c) G R b ■ a ■ c = b ■ a ■ c 

M 

(6, c) ^ R b ■ c b ■ a ■ c 

Hence ffEQNg is #P-complete. □ 

We continue by proving a dichotomy in the case where the semigroup is a band. 

Theorem 10. If (S, •) is a band, then ffEQNg is tractable if (S, •) is a rectan- 
gular hand and #P-complete otherwise. 

Proof. It follows directly from Lemma 3 that if S is a band that is not a rect- 
angular band then ffEQNg is #P-complete. If (S, •) is a rectangular band we 
know from Theorem 5 that there exist a left zero semigroup (A, •/) and a right 
zero semigroup {R, >) such that (S, •) = (A, •;) x (i?, >). Thus the tractability of 
ffEQNg when (S, •) is a rectangular band follows from Theorem 7. □ 

The following lemma gives us an important necessary condition for the tractabil- 
ity of #EQN^. 

Lemma 4. Given a semigroup (5, •), it is a necessary condition for ffEQNg 
to he tractable that {Div{S),-) form a regular semigroup. Otherwise ffEQNg is 
#P-complete. 
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Proof. Arbitrarily choose a G Div{S) and assume that a = b ■ c for some b, c. 
Since S is finite, we know from Theorem 3 that 6” and c™ are idempotent for 
some m, n. From Lemma 3 we know that a = b-c = b-c^ ■b'^ -c = -6"“^ - a, 

otherwise ffEQNg is #P-complete. Thus it is a necessary condition for ffEQNg 
to be tractable that all elements in Div{S) are regular. □ 

Now we can prove a dichotomy ffEQNg when (S', •) is a regular semigroup. 

Theorem 11. Let (S, •) be a finite regular semigroup. Then ffEQNg is tractable 
if {S, •) is a direct product of an Abelian group and a rectangular band. Otherwise 
ffEQNg is #P-complete. 

Proof. If the idempotents in (S, •) do not form a rectangular band then by 
Lemma 3, ffEQNg is #P-complete. If the idempotents in (S, •) do form a rect- 
angular band then by Theorem 6, (S, •) is a direct product of a group and a 
rectangular band. We conclude from Theorem 7 that ffEQNg is tractable if 
(S, •) is a direct product of an Abelian group and a rectangular band. 

Now assume that (S, •) is isomorphic to the direct product of a non- Abelian 
group by a rectangular band i.e. (S, •) = (G, -g) x {RB, •{,) where (G, -g) is a 
non-Abelian group and {RB, ■{,) is a rectangular band. Consider the following 
relation R = {{x,y)\x ■ y = y ■ x}. We show that R is closed under no Mal’tsev 
operation. The elements below are chosen as follows, 1 is the identity in (G, -g), 
since (G, -g) is non-Abelian we know that there exists elements a, 6 in G such 
that a -g b ^ b -g a, and c is an arbitrary element in {RB, -b). 

{{a, c), (1, c)) e R {a, c) • (1, c) = (1, c) • (a, c) 

((l,c),(l,c)) G R (l,c) • (l,c) = (l,c) • (l,c) 

((1, c), (6, c)) G R (1, c) • {b, c) = {b, c) • (1, c) 

M 

{{a, c), {b, c)) i R {a, c) • {b, c) yf {b, c) • (a, c) 

Hence ffEQNg is #P-complete when {S, •) is isomorphic to the direct product 
of a non-Abelian group by a rectangular band. □ 

The following corollary follows directly from Lemma 4 and Theorem 11. 

Corollary 1. If {S, •) is a finite semigroup where all elements have divisors, 
then ffEQNg is tractable if {S, •) is a direct product of an Abelian group and a 
rectangular band, and #P-complete otherwise. 

The following theorem proves a dichotomy for ffEQNg when {S, •) is a semigroup 
with zero. 

Theorem 12. Let {S, •) be a finite semigroup with zero, then ffEQNg is 
tractable if for all x,y in S , x • y = 0, and #P-complete otherwise. 

Proof. The tractability part is proved in Theorem 7. We continue by proving 
the #P-completeness of ffEQNg when {S, •) is a semigroup with zero and there 
exist two elements a, 6 in S' such that a ■ b ^ 0. Consider the following relation 
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R = {{x,y)\x • y = 0}. It is easy to see that R is closed under no Mal’tsev 
operation. 

(a, 0) G i? a • 0 = 0 

(0,0) Gi? 0-0 = 0 

(0,6) Gi? 0-6 = 0 

M 

(a,b) ^ R a ■ b ^ 0 

Hence ^EQNg is #P-complete when (S', -) is a semigroup with zero and there 
exist two elements x,y in S such that x • y ^ 0. □ 

It is easy to realize that all the conditions we impose on our semigroups to ensure 
the tractability (or #P-completeness) of ^EQNg can be checked in polynomial 
time (in the size of the semigroups), except maybe for the condition in Theorem 
11. To check whether a semigroup, isomorphic to a direct product of a group 
and a rectangular band, is in fact isomorphic to a direct product of an Abelian 
group and a rectangular band, we need a way to extract the group part from the 
semigroup. Choose an idempotent x in (S, -). Then we know that x is of the form 
(1, a;') where 1 is the identity in (G, -g). Consider the set xSx, elements in this set 
is of the form (1, x') ■ (a, y) ■ (1, x') = (a, x'). This implies that {xSx, -) = (G, -g). 
Now it is easy to check whether (G, -g) is Abelian. 
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Abstract. The compaction and retraction problems are special graph 
colouring problems, and can also be viewed as partition problems with 
certain properties. A very close relationship between the compaction, re- 
traction, and constraint satisfaction problems has been established ear- 
lier providing evidence that it is likely to be difficult to give a complete 
computational complexity classihcation of the compaction and retrac- 
tion problems for reflexive or bipartite graphs. In this paper, we give a 
complete computational complexity classification of the compaction and 
retraction problems for all graphs (including partially reflexive graphs) 
with four or fewer vertices. The complexity classihcation of both the 
compaction and retraction problems is found to be the same for each of 
these graphs. This relates to a long-standing open problem concerning 
the equivalence of the compaction and retraction problems. The study 
of the compaction and retraction problems for graphs with at most four 
vertices has a special interest as it covers a popular open problem in 
relation to the general open problem. We also give complexity results for 
some general graphs. 



1 Introduction 

We first introduce the following definitions and problems, and then describe the 
motivation and results. 

1.1 Definitions 

An edge with the same endpoints in a graph is called a loop. A vertex u of a graph 
is said to have a loop if vv is an edge of the graph. A reflexive graph is a graph in 
which every vertex has a loop. An irreflexive graph is a graph which has no loops. 
Any graph, in general, is a partially reflexive graph, meaning that its individual 
vertices may or may not have loops. Thus reflexive and irreflexive graphs are 
special partially reflexive graphs. A bipartite graph is irreflexive by definition. 
We denote an irreflexive complete graph with k vertices by Kk- For a graph 
G, we use V{G) and E{G) to denote its vertex set and edge set respectively. 
Given an induced subgraph H of a, graph G, we denote hy G — H, the subgraph 
obtained by deleting from G the vertices of H together with the edges incident 
with them; thus G — H is a subgraph of G induced by V{G) — V{H). In the 
following, let G and H be graphs. 
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A homomorphism f : G ^ H, of G to H, is a, mapping / of the vertices 
of G to the vertices of H, such that f{g) and f{g') are adjacent vertices of H 
whenever g and g' are adjacent vertices of G. If there exists a homomorphism of 
G to H then G is said to be homomorphic to H . Note that for any homomorphism 
/ : G ^ i/, if a vertex v of G has a loop then the vertex f{v) of H necessarily 
also has a loop. If G is irreflexive then clearly G is /c-colourable if and only if 
G is homomorphic to Kk- Thus the concept of a homomorphism generalises the 
concept of a fc-colourability. 

A compaction c : G ^ H , of G to H , is a homomorphism of G to i?, such 
that for every vertex x of H, there exists a vertex v of G with c(v) = x, and 
for every edge hh' of H, h ^ h', there exists an edge gg' of G with c{g) = h 
and c(g') = h' . Notice that the first part of the definition for compaction ( the 
requirement for every vertex x of H) is relevant only if H has isolated vertices. 
If there exists a compaction of G to i? then G is said to compact to H. Given 
a compaction c : G ^ H , if for a vertex v of G, we have c{v) = x, where a; is a 
vertex of H , then we say that the vertex v of G covers the vertex x of H under 
c; and if for an edge gg' of G, we have c{{g, g'}) = {h, h'}, where hh' is an edge 
of H, then we say that the edge gg' of G covers the edge hh' of H under c (note 
in the definition of compaction, it is not necessary that a loop of H be covered 
by any edge of G under c). 

We note that the notion of a homomorphic image used in [Harary, 1969] (also 
cf. [Hell and Miller, 1979]) coincides with the notion of a compaction in case of 
irreflexive graphs (i.e., when G and H are irreflexive in the above definition for 
compaction) . 

A retraction r : G ^ H, of G to H, with H as an induced subgraph of G, 
is a homomorphism of G to i7, such that r{h) = h, for every vertex h of H . If 
there exists a retraction of G to then G is said to retract to H. Note that 
every retraction r : G — > is necessarily also a compaction but not vice versa. 

1.2 Homomorphism, Compaction, and Retraction Problems 

The problem of deciding the existence of a homomorphism to a fixed graph H , 
called the H-colouring problem, and denoted as H-COL, asks whether or not an 
input graph G is homomorphic to i7. If is a graph with a loop then every 
graph is trivially homomorphic to H . Also, if is a bipartite graph then we 
note that a graph G is homomorphic to H if and only if G is also bipartite and 
H has an edge if G has an edge. Thus the problem H-COL is interesting only 
if H is an irreflexive non-bipartite graph. A complete complexity classification 
of H-COL is given in [Hell and Nesetril, 1990]. It is shown there that H-COL 
is NP-complete for any irreflexive non-bipartite graph H . As pointed above, H- 
COL is polynomial time solvable otherwise. Note that the classic fc-colourability 
problem is a special case of the problem H-COL when H is Kk and the input 
graph G is irreflexive. 

The problem of deciding the existence of a compaction to a fixed graph H, 
called the compaction problem for H, and denoted as COMP-H, asks whether 
or not an input graph G compacts to H . 
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The problem COMP-H, in general, where H is a, fixed partially refiexive 
graph, can be viewed as the problem to decide whether or not it is possible to 
partition the vertices of a graph into certain fixed number of distinct non-empty 
sets such that there is at least one edge between some pair of distinct sets, and 
there is no edge between all other pair of distinct sets, and certain sets may be 
required to be independent (an independent set has no edge), where the sets 
and edges correspond to the vertices and edges of H , and an independent set in 
particular correspond to a vertex without a loop in H . 

When both G and H are input graphs (i.e., H is not fixed), and H is refiexive, 
the problem of deciding whether or not G compacts to H has been studied in 
[Karabeg and Karabeg, 1991, 1993]. Some related work has recently been studied 
in [Feder, Hell, Klein, and Motwani, 1999, 2003]. Note that unlike the problem 
H-COL, the problem COMP-H is still interesting if H has a loop or P[ is bipartite. 

The problem of deciding the existence of a retraction to a fixed graph H , 
called the retraction problem for H, and denoted as RET-H, asks whether or 
not an input graph G, containing H as an induced subgraph, retracts to H. 
Retraction problems have been of continuing interest in graph theory for a long 
time and have been studied in various literature including [Hell, 1972, 1974], 
[Nowakowski and Rival, 1979], [Pesch and Poguntke, 1985], [Bandelt, Dahlmann, 
and Schutte, 1987], [Hell and Rival, 1987], [Pesch, 1988], [Feder and Winkler, 
1988], [Bandelt, Farber, and Hell, 1993], [Feder and Hell, 1998], [Feder, Hell, 
and Huang, 1999]. The problem RET-H can again be viewed as the partition 
problem, as described above, with the restriction that each vertex of 77 is in a 
distinct set of the partition. 

Note that the graph H for the problems H-COL, COMP-H, and RET-H is 
assumed to be fixed by default even if not explicitly mentioned. 

1.3 Motivation and Results 

It is not difficult to show that for every fixed graph H, if RET-H is solvable in 
polynomial time then COMP-H is also solvable in polynomial time (a polyno- 
mial transformation from COMP-H to RET-H under Turing reduction is shown 
in [Vikas, 2004b]). Is the converse true? It was also asked, in the context of re- 
fiexive graphs, by Peter Winkler in 1988 (personal communication, cf. [Feder and 
Winkler, 1988]). Thus the question is whether RET-H and COMP-H are polyno- 
mially equivalent for every fixed graph 77. The answer to this is not known even 
when 77 is refiexive or bipartite. However, it is shown in [Vikas, 2004b] that for 
every fixed refiexive (bipartite) graph 77 there exists a fixed refiexive (bipartite) 
graph 77' such that RET-H and COMP-H' are polynomially equivalent. 

Using the above result of [Vikas, 2004b], and results of [Feder and Hell, 1998] 
and [Feder and Vardi, 1998], it is established in [Vikas, 2004b] that for every 
constraint satisfaction problem 77 (with fixed templates), there exists a fixed 
refiexive (bipartite) graph 77 such that the constraint satisfaction problem 77 
and the compaction problem COMP-H are polynomially equivalent. Since it is 
thought to be likely difficult to determine whether every constraint satisfaction 
problem (with fixed templates) is polynomial time solvable or NP-complete, thus 
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evidence is provided in [Vikas, 2004b] that it is likely to be difficult to determine 
whether for every fixed reflexive (bipartite) graph H, the problem COMP-H is 
polynomial time solvable or NP-complete. Similar evidence has been shown for 
RET-H in [Feder and Hell, 1998] in the case of fixed reflexive graphs H, and in 
[Feder and Vardi, 1998] in the case of fixed bipartite graphs H. Issues related 
to the constraint satisfaction problem have also been considered in [Feder and 
Vardi, 1993, 1998]. 

We however give in this paper, a complete complexity classification of COMP- 
H and RET-H when H has four or fewer vertices, i.e., for every graph H with 
at most four vertices (including when H is partially reflexive), we determine 
whether COMP-H is polynomial time solvable or NP-complete, and whether 
RET-H is polynomial time solvable or NP-complete. We find that the com- 
plexity classification of COMP-H and RET-H do not differ for such graphs H. 
Studying the complexity classification of the compaction and retraction prob- 
lems for graphs with at most four vertices has an additional significance as it 
includes a widely publicised open problem posed by Peter Winkler in 1988 to 
determine the complexity of COMP-H when H is a reflexive 4-cycle. This has 
been shown to be NP-complete in [Vikas, 1999, 2003]. The problem was asked in 
relation to the general problem mentioned earlier concerning the equivalence of 
the compaction and retraction problems, as the unique smallest reflexive graph 
H for which RET-H is NP-complete turned out to be a reflexive 4-cycle. 

Thus our study in this paper is motivated by two issues. One issue is con- 
cerned with the complete complexity classification of the compaction and re- 
traction problems, and the other issue is concerned with the equivalence of the 
compaction and retraction problems. We present results in this paper resolving 
fully the two issues for graphs up to four vertices. We hope that the techniques 
and constructions developed in this paper and the papers [Vikas, 1999, 2003, 
2004a, 2004b] would be helpful in resolving the bigger problem whether the 
compaction and retraction problems are equivalent for all graphs. 

We have more results showing that for several graphs H, the problems RET- 
H and COMP-H are polynomially equivalent. We mention below a few classes 
of such graphs. We do not know of any graph H for which the complexity 
classification of RET-H and COMP-H differ. 

It is known that RET-H is NP-complete when H is a reflexive k-cycle, for all 
fc > 4, cf. [Feder and Hell, 1998], G. MacGillivray, 1988 (personal communica- 
tion), and for k = 4, [Feder and Winkler, 1988] also. It is shown in [Vikas, 1999, 
2003] that COMP-H is NP-complete when H is a reflexive fc-cycle, for all k > 4. 
In particular, as mentioned above, for fc = 4, this result in [Vikas, 1999, 2003] 
solves a widely publicised open problem posed by Peter Winkler in 1988. When 
H is a reflexive chordal graph (which includes a reflexive 3-cycle), the problem 
RET-H is polynomial time solvable [Feder and Hell, 1998], and hence COMP-H 
is also polynomial time solvable. 

It is also known that RET-H is NP-complete when H is an irreflexive even 
fc-cycle, for all even fc > 6, cf. [Feder, Hell, and Huang, 1999], G. MacGillivray, 
1988 (personal communication). It is shown in [Vikas, 1999, 2004a] that COMP- 
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H is NP-complete when H is an irreflexive even fc-cycle, for all even k > 6. This 
result in [Vikas, 1999, 2004a] also solves a problem that has been of interest, 
since a long time, to various people including Pavol Hell and Jaroslav Nesetril 
(personal communications) . When H is a chordal bipartite graph (which includes 
an irreflexive 4-cycle), the problem RET-H is polynomial time solvable [Bandelt, 
Dahlmann, and Schutte, 1987], and hence COMP-H is also polynomial time 
solvable. 

The case of irreflexive odd cycles is a special case of a more general re- 
sult whereby RET-H and COMP-H are polynomially equivalent for every non- 
bipartite irreflexive graph H. It follows from the complexity of H-COL [Hell and 
Nesetril, 1990] that RET-H and COMP-H are NP-complete for any non-bipartite 
irreflexive graph H; in particular if H is an irreflexive odd fc-cycle then RET-H 
and COMP-H are NP-complete, for all odd fc > 3. Thus we conclude that RET- 
H and COMP-H both are NP-complete when H is an irreflexive fc-cycle, for all 
k > 3, k ^ 4, and polynomial time solvable, for k = 4. 

Let H he & graph and let Vl denote its set of vertices which have loops. It 
is shown by Feder, Hell, and Huang (personal communication, cf. [Feder, Hell, 
and Huang, 1999]) that if H is connected but Vl is not then RET-H is NP- 
complete. It is also shown by them (cf. [Feder, Hell, and Huang, 1999]) that if 
Vl is connected and H is a tree then RET-H is polynomial time solvable. 

With regards to more general graphs (not necessarily with at most four ver- 
tices), we show in this paper that if is a path of length > 2, with loops on 
the first (origin) and the last (terminus) vertices only, then COMP-H is NP- 
complete. Using this result, we also show in this paper that COMP-H is NP- 
complete when H is a graph consisting of only several internally disjoint paths 
of the same length > 2, having loops only on the common origin and terminus 
vertices. Also, we show in this paper that if is a graph consisting of only a 
path of length > 2, and a vertex outside the path adjacent to every vertex of the 
path, with loops only on the origin and the terminus vertices of the path, then 
COMP-H is NP-complete. We do not include the proofs here for these general 
graphs. For these graphs H, it follows from the above result of Feder, Hell, and 
Huang that RET-H is also NP-complete. 

We also show in this paper, the relationship between compaction to connected 
and disconnected graphs, and the relationship between retraction to connected 
and disconnected graphs. These results establish that the complexity of COMP- 
H and RET-H is not affected whether the graph H is connected or disconnected. 
Hence it would be sufficient to consider only connected graphs H when deter- 
mining the complexity of COMP-H or RET-H in the next section. We do not 
include the proofs here for these results. 

In the figures in this paper, we shall not be depicting any edge vh of G, 
with V G V{G — H) and h G V{H), where G is any graph containing H as 
an induced subgraph, i.e., G is an instance of RET-H. In Section 2, we give a 
complete complexity classification of COMP-H as well as RET-H when H has 
four or fewer vertices. 
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2 A Complete Complexity Classification 

of Compaction and Retraction to All Graphs 
with Four or Fewer Vertices 

In this section, we give a complete complexity classification of COMP-H and 
RET-H when H has four or fewer vertices. We shall see that the complexity 
classification of COMP-H and RET-H do not differ for such graphs H. When 
considering graphs H with four or fewer vertices, we will not be separately 
considering graphs H that fall in the categories discussed in Section 1.3, or are 
trivial (we do not explicitly point out all such cases here). Also, due to our results 
on compaction and retraction to connected and disconnected graphs, we will not 
be separately considering graphs H that are disconnected. For graphs H with 
four or fewer vertices, this leaves us to consider the graphs H in Figure 1. We 
consider each of the graphs H in Figure 1, and present the complexity results 
for COMP-H and RET-H 

For the graphs H in Figures 1(a), (6), (c), (d), (e), (/), (j), (fc), (m), and (p), 
the set of vertices of H with loops is disconnected (and H is connected), and 
hence it follows that RET-H is NP-complete, cf. [Feder, Hell, and Huang, 1999]; 
we however need to determine the complexity of COMP-H. We will be including 
here details for only some of the graphs. 

Theorem 2.1 COMP-H is NP-complete for the graph H in Figure 1(a). 

Proof. Let H be the path hohih 2 in Figure 1(a) with loops on ho and /12 only. It 
is clear that COMP-H is in NP. We prove NP-completeness of COMP-Hhy giving 
a polynomial transformation from RET-H to COMP-H. As mentioned above, the 
problem RET-H is NP-complete, cf. [Feder, Hell, and Huang, 1999]. Let G be a 
graph containing H as an induced subgraph, i.e., let G be an instance of RET-H. 
We construct in time polynomial in the size of G, a graph G' (containing G as 
an induced subgraph) such that the following statements (i), (ii), and (iii) are 
equivalent : 

(i) G retracts to H. 

(ii) G' retracts to H. 

(iii) G' compacts to H. 

We prove that (i) is equivalent to (ii), and (ii) is equivalent to (iii), in Lemma 2.1.1 
and Lemma 2.1.2 respectively. Since RET-H is NP-complete, this shows that 
COMP-H is NP-complete. 

The construction of G' is as follows. For each vertex v in V{G — H), we add 
to G three distinct new vertices : Uy adjacent to v, ho; Wy adjacent to v, Uy] and 
yy adjacent to Uy, Wy, ft- 2 - Thus Uy, Wy, and form a triangle, and Uy, Wy, and v 
form a triangle. See Figure 2. Note that there could be edges in G from v to some 
vertices of H but as mentioned earlier, in Figure 2 and all subsequent figures in 
this paper, we are not depicting these edges. This completes the construction of 
G'. 

We now prove the following two lemmas in order to prove the theorem. 
Lemma 2.1.1 G retracts to H if and only if G' retracts to H. 
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Fig. 1. List of graphs H to be considered 



Proof. If G' retracts to H then it is clear that G retracts to H since G is a 
subgraph of G' . Now suppose that r : G — > i? is a retraction. We define a 
retraction r' : G' ^ H as follows. 

For each vertex v of the graph G, we define 
r'(v) = r{v). 

For the vertices Uv,Wv, and of G', with v GV{G — H), we define 
r'{uv) = /iQ, r'{wv) = h(j, and r'{yy) = hi, if r{v) = Hq or hi, and 
r'{uy) = hi, r'(wy) = /i2, and r'(yy) = /i2, if r(v) = /12. 
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V 



Fig. 2. Construction of G' for a vertex v in G — H 



It is not difficult to see that for every edge ab of G', r'{a)r'{b) is an edge of 
H, i.e., r' : G' — > i/ is a homomorphism. Recall that the edges of G' are : gg', 
vuv, vwv, hoUy, h 2 yv, UyWy, Uyijy, and iVyUv, with gg' G E(G), v € V{G — H). 
Since r'{h) = r{h) = h, for all h G V{H), the homomorphism r' : G' — > i/ is a 
retraction, and the lemma is proved. □ 

Lemma 2.1.2 G' retracts to H if and only if G' compacts to H. 

Proof. If G' retracts to H then by definition G' compacts to H . Now suppose 
that c : G' — > 71 is a compaction. We first prove that c(/iq) yf c(/i 2 )- Suppose that 
c(lo) = c{h 2 ). Since Iq and /12 both have loops, it must be that c(/iq) = c(/i 2 ) = 
lo or c(lo) = c(/i 2 ) = h. 2 . Without loss of generality, let c(lo) = c{h 2 ) = ho. We 
can assume this, as due to symmetry of vertices in 77, we can always redefine the 
compaction c so that c{ho) = c(/i 2 ) = ho. Since Uy, yy, and hi are adjacent to ho 
or /i 2 ) c{uy) yf /i 2 , c{yy) yf h 2 , and c(/ii) yf h 2 , with v gV(G — 77). This implies 
that if c(wy) = h 2 , for some vertex v G V{G — 77), then c(uy) = c(y„) = 7i, as 
Uy and yy are adjacent to Wy. But this is impossible as Uy is adjacent to yy, and 
hi does not have a loop. Hence c{wy) yf 72, for any v G V{G — 77). Similarly, if 
c{v) = 72, for some vertex v G V{G — 77), then c{uy) = c{wy) = hi, and hi does 
not have a loop. Hence c{v) yf 72, for any v G V{G — 77). 

Thus, we have shown that c(a) yf 72, for all a G V(G'), contradicting our 
assumption that c : G' — > 77 is a compaction. Hence, it must be that c(7q) yf 
0 ( 72 ). Since 7q and 72 are the only vertices in 77 with a loop, c({7q,72}) = 
{7o,72}. Without loss of generality, let c(7q) = 7q and c ( h2 ) = 72 (due to 
symmetry). Since hi is adjacent to 7 q and 72, we have c(7i) = hi. Thus c : 
G' — > 77 is a retraction, and the lemma is proved. □ 

This completes the proof of Theorem 2.1. □ 

Theorem 2.2 COMP-H is NP-complete for the graph 77 in Figure 1(g). 
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Proof. Clearly, COMP-H is in NP. Let S be the irreflexive triangle hohih2ho. 
We give a polynomial transformation from RET-S to COMP-Pl. As mentioned 
earlier, it follows from [Hell and Nesetril, 1990] that RET-S is NP-complete. Let 
a graph G containing S as an induced subgraph be an instance of RET-S. We 
construct in time polynomial in the size of G, a graph G' (containing G and 
H as induced subgraphs) such that the following statements (i), (ii), and (iii) 
are equivalent : (i) G retracts to S, (ii) G' retracts to H, (iii) G' compacts to 
p[. We prove that (i) is equivalent to (ii), and (ii) is equivalent to (iii), in two 
separate lemmas, which we do not include here. Since RET-S is NP-complete, 
this shows that COMP-H is NP-complete. We note that the equivalence of (i) 
and (ii) shows that RET-H is also NP-complete. The fact that RET-H is NP- 
complete, we have also shown using a simpler construction which we do not 
include here. The complexity of RET-H was also unknown before. 

The construction of G' is as follows. For each vertex u of G — S', we add to G 
six distinct new vertices : adjacent to v and /iq; Wy adjacent to v, hi, and 

Xy adjacent to v and hg; yy adjacent to Xy, Wy, and ho; Sy adjacent to v and hi; 
and Zy adjacent to s„, Uy, and hi. We also add to G a new vertex /13 adjacent to 
h.2 and itself only (thus H is an induced subgraph of the resulting graph). See 
Figure 3. This completes the construction of G'. □ 




Fig. 3. Construction of G' for a vertex v in G — H 



Theorem 2.3 COMP-H and RET-H are polynomial time solvable for the graph 
H in Figure l(q). 

Proof. Let the graph H in Figure 1(5) be hohi/i2h3ho with loops on all the ver- 
tices except /13. We give a polynomial time algorithm for RET-H. Since COMP-H 
polynomially transforms to RET-H (under Turing reduction), this shows that 
COMP-H is also polynomial time solvable. Let a graph G containing H as an 
induced subgraph be an instance of RET-H. Let S' be a subgraph of G induced 
by Nhr{h^) (which includes /iq and /12). The algorithm for RET-H is described 
below. 
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Fig. 4. List of All Graphs H with at most Four Vertices for which COMP-H and 
RET-H are NP-complete 



1 . If there exists a path from ho to h.2 in S then G does not retract to H. 

2 . Else G retracts to H with a retraction r : G ^ H defined as follows. We have 
that ho and h2 are disconnected in S. Let ^ be a component of S containing h,2. 
Thus ho ^ Z. We define 

r(h) = h, for all h G V{H), 
r{v) = ho, for all v G Nbr{h^) — V{Z), 
r(v) = h2, for all v G V(Z), and 
r(v) = hi, for all v G V(G) - (Nbrihs) U V{H)). 



□ 
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We consider the other graphs also, and show that for all the graphs H 
in Figure 1, COMP-H and RET-H are NP-complete or polynomial time solv- 
able, and that the complexity classification of COMP-H and RET-H do not dif- 
fer. For the graphs H in Figures 1(a), (6), (c), (d), (e), (/), (g), (j), (k), (m), and 
(p), both COMP-H and RET-H are NP-complete. For the graphs H in Fig- 
ures and (q), both COMP-H and RET-H are polynomial 

time solvable (it is not difficult to see that none of these graphs are absolute 
retracts which would imply polynomiality anyway; we do not discuss absolute 
retracts here). As discussed before, for the graphs H with at most four vertices 
not listed in Figure 1, either we know from the results mentioned in Section 1.3, 
or are trivial, or we can infer from our results on compaction and retraction to 
connected and disconnected graphs, that COMP-H and RET-H are NP-complete 
or polynomial time solvable, and notice again that the complexity classification 
of COMP-H and RET-H do not differ. 

A list of all graphs H with at most four vertices for which COMP-H and 
RET-H are NP-complete is given in Figure 4. For all other graphs H with at 
most four vertices, not in this list, both COMP-H and RET-H are polynomial 
time solvable. 
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Abstract. In a graph G, k disjoint paths joining a single source and k 
distinct sinks that cover all the vertices in the graph are called a one-to- 
many k-disjoint path cover of G. We consider a fc-disjoint path cover in a 
graph with faulty vertices and/or edges obtained by merging two graphs 
Ho and Hi, |U(_ffo)| = |P(^i)| = n, with n pairwise nonadjacent edges 
joining vertices in Ho and vertices in Hi. We present a sufficient condition 
for such a graph to have a fc-disjoint path cover and give the construction 
scheme. Applying our main result to interconnection graphs, we observe 
that when there are / or less faulty elements, all of recursive circulant 
G(2’",4), twisted cube TQm, and crossed cube CQm of degree m have 
fc-disjoint path covers for any / > 0 and fc > 2 such that / -|- fc < m — 1. 



1 Introduction 

Usually a measure of reliability (or fault-tolerance) of an interconnection network 
is given by the maximum number of nodes which can fail simultaneously without 
prohibiting the ability of each fault-free node to communicate with all other 
fault-free nodes. Connectivity of an interconnection graph corresponds to the 
reliability of the interconnection network which is subject to node failures. 

It is well-known that connectivity of a graph G was characterized in terms 
of disjoint paths joining a pair of vertices in G. According to Monger’s theorem, 
a graph G is fc-connected if and only if for every pair of vertices s and t, G has 
k disjoint paths (of type one-to-one) joining them. It is straightforward to verify 
that a graph G is fc-connected if and only if G has fc disjoint paths (of type 
one-to-many) joining every source s and fc distinct sinks ■ ■ ■ ,tk such that 

ti s for all 1 < i < fc. 

Sometimes “restricted” one-to-many disjoint paths joining a source s and fc 
distinct sinks ti,t 2 , ■ ■ ■ ,tk are required. When we analyze Rabin number [1] of a 
graph G, we are to find fc disjoint paths P\,P 2 , ■ ■ ■ ,Pk such that max{^i} is as 
small as possible, where Pi is an s-tt path of length k, 1 < i < k. A star-like 
tree [2] is a subdivision of a star, a tree with only one vertex which is not an 
endvertex. Given a source s and lengths ^i, I 2 , ■ ■ ■ ,lk of k paths, the star-like tree 
problem is essentially to find one-to-many disjoint paths joining s and some fc 
sinks whose lengths are ^i, I 2 , ■ ■ ■ , h, respectively. 

* This work was supported by grant No. R05-2003-000-11506-0 from the Basic Re- 
search Program of the Korea Science & Engineering Foundation. 
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Given a source s and a set of k sinks T = {ti, ^ 2 , • ■ • , tfc} in a graph G, we 
are concerned with one-to-many disjoint paths Pi, P 2 , . . . , Pk G joining s and 
T that cover all the vertices in the graph, that is, Ui<i<fc^(^*) = 

V{Pi) n V{Pj) = {s} for all i yf j. Here V{Pi) and V{G) denote the vertex sets 
of Pi and G, respectively. We call such k disjoint paths a one-to-many k-disjoint 
path cover (in short, one-to-many k-DPC) of G. 

Embedding of linear arrays and rings into a faulty interconnection graph 
is one of the central issues in parallel processing. The problem is modelled as 
finding as long fault-free paths and cycles as possible in the graph with some 
faulty vertices and/or edges. A graph G is called f -fault hamiltonian (resp. /- 
fault hamiltonian- connected) if there exists a hamiltonian cycle (resp. if each 
pair of vertices are joined by a hamiltonian path) in G\F for any set F of 
faulty elements such that |F| < /. For a graph G to be /-fault hamiltonian 
(resp. /-fault hamiltonian-connected), it is necessary that / < <5(G) — 2 (resp. 
/ < — 3), where 5{G) is the minimum degree of G. 

To a graph G with a set of faulty elements F , the definition of a one-to-many 
disjoint path cover can be extended. Given a source s and a set of k sinks T = 
{ti,t 2 , ■ . ■ , tfc} in G\F, a one-to-many fc-disjoint path cover joining s and T is k 
disjoint paths Pi joining s and ti, 1 < i < k, such that Ui<i<fc ^(^0 = 

V{Pi) n V{Pj) = {s} for all i yf j, and every edge on each path Pi is fault- 
free. Such a one-to-many fc-DPG is denoted by /c-DPG[s, {^ 1 ,^ 2 , • ■ • , tk}\G, E]. A 
graph G is called f -fault one-to-many k-disjoint path coverahle if for any set F 
of faulty elements such that \F\ < f, G has k-DPC[s,{ti,t 2 , ■ ■ ■ ,tk}\G, F] for 
every source s and k distinct sinks t\,t 2 , ■ ■ ■ ,tk in G\F. 

Proposition 1. Let G he a graph. The following statements are equivalent. 

(a) G is f -fault hamiltonian-connected. 

(b) G is f -fault one-to-many 1-disjoint path coverahle. 

(c) G is f -fault one-to-many 2-disjoint path coverahle. 

We are given two graphs Hq and Fix with n vertices. We denote by Vi 
and Ei the vertex set and edge set of Fli, i = 0,1, respectively. We let Vq = 
{vi,V 2 , . . . , Vn\ and V\ = {wi,W 2 , ■ ■ ■ , Wn}. With respect to a permutation M = 
{ii,i 2 , . . . ,in) of {1,2, ...,n}, we can “merge” the two graphs into a graph 
Ho 0M Hi with 2n vertices in such a way that the vertex set V = VoUVi 
and the edge set if = Eq U U E 2 , where E 2 = {(uj,Wi^)|l < j < n}. We 
denote by Hq 0 Eli an arbitrary graph obtained by merging PIq and ifi w.r.t. 
some permutation M. Here, Flo and Hi are called components of Ho 0 Hi. 

In this paper, we will show that we can always construct /-fault one-to-many 
k 0 1-DPG in Ho 0 Hi by using /-fault one-to-many fc-DPG of Hi and fault- 
hamiltonicity oi Hi, i = 0,1. Precisely speaking, we will prove the following. 
Theorem 1. For / > 0 and k >2, let Hi, z = 0, 1, he a graph with n vertices 
satisfying the following three conditions: 

(a) Hi is f -fault one-to-many k-disjoint path coverahle. 

(b) Hi is f -\- k — 2-fault hamiltonian-connected(2-disjoint path coverahle). 

(c) Hi is f -\- k — 1-fault hamiltonian. 

Then, Ho 0 Hi is f -fault one-to-many k 0 1-disjoint path coverahle. 
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By applying the above theorem to interconnection graphs, we will show that 
all of recursive circulant G(2™,4), twisted cube TQm, and crossed cube CQm 
of degree m are /-fault one-to-many /c-disjoint path coverable for every pair of 
/ > 0 and k >2 such that / -I- A: < to — 1 . 

2 Construction of One-to-Many DPC in Hq 0 Hi 

In this section, we will prove Theorem 1 by constructing a one-to-many fc -|- 1- 
DPC in Hq 0 Hi when the number of faulty elements is / or less. Since each Hi 
satisfies the condition (c) of Theorem 1, it is necessary that f + k— l<Si — 2, 
where Si = S{Hi). That is, 

f + k < 5 — 1, 

where 6 = min{i5o, lii}- We assume w.l.o.g. that the source s is in Hq, and there 
are k + 1 distinct sinks T = {t\,t 2 , ■ ■ ■ h sinks in Hi, i = 0, 1. Thus, 

kQ + k\ = k + 1. 

We assume that . . . ,tk^} is the set of sinks in Hi. We denote by F the 

set of faulty elements in Hq(BHi. We let /o, fi, and /2 be the numbers of faulty 
elements in Hq, in Hi, and between Hq and Hi, respectively. Let // and // be 
the numbers of faulty vertices and edges in Hi, respectively. Then, 

= /o + /i + /2 < /, and // 0 // = fi,i = 0, 1. 

Before going on, we need some definitions and notation. For a vertex v in 
Hq(BHi, we denote by v the vertex adjacent to v which is in a different component 
from V. A vertex v in Hq 0 Hi is called free if u yf s, u ^ T, and v ^ F. A free 
bridge of a vertex w is the path {w, w) of length one if w ^ T, w ^ F, and 
{w, w) ^ F; otherwise, it is a path {w, v, v) of length two such that v ^ w, v and 
V are free vertices, and (w,v), (v,v) ^ F. 

Lemma 1. For any source and sink w in Hq 0 Hi, there exists a free bridge 
ofw. 

Proof There are at least i5 0 1 candidates for a free bridge of w, and at most 
/ 0 A: 0 1 elements “block” the candidates. For the source w, at most / faulty 
elements and A:0 1 sinks form blocking elements. For a sink w, blocking elements 
are faulty elements, k sinks (excluding w itself), and the source. Since f + k < 
(5 — 1, the lemma is proved. □ 

We define L to be a set of edges joining a sink w in Hi and w in Hq different 
from s such that (w,w) is not the free bridge of w. That is, 

L = {{w, w)\w is a sink in Hi,w yf s, and w G T or w G F or {w, w) G 0}. 

An edge (w, w) in L is called matched and its endvertices are called matched 
vertices. We denote by //" be the number of matched faulty edges, faulty edges 
in L. We let I = \L\. Then, 

l<kQ + f^ + ft 
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If s G r and (s, s) G F, we let a = 1; otherwise, let a = 0. Obviously, 

I + a < ki, and + ol < 

First we will construct /-fault one-to-many A:-|- 1-DPC for the case of k\ > 1, 
and then we will consider the case of fci = 0 later in Section 2.3. Construction I 
gives an /-fault one-to-many k + 1-DPC unless the following conditions of Cl, 
C2, and C3 are satisfied: 

Cl: fci = 1, s G T, /o = /, 

C2: fci -l- /i > k + f , 

C3: I + cx > 2, I = ko + Jq + f 2 , f = fo + fi + f 2 + a. 

Construction II considers the case that satisfies C3. Remaining cases are con- 
sidered later in Section 2.1 and 2.2. We denote hy H[v,w\Hi, F] a hamiltonian 
path in Hi\F joining a pair of fault-free vertices v and w. We let Fi = F O Vi. 

Construction I 

UNDER the condition of > 1. 

UNLESS Cl, C2, and C3 

{ an arbitrary matched sink, if ? > 1; 
s, if ? = 0 and s G T; 

an arbitrary sink, it I = 0 and s ^ T. 

2. Find a free bridge of s and free bridges Bj = of tj for all j, 

2 < / < A:i. They should be pairwise disjoint. 

3. When (s, s) is the free bridge of s: 

- Find fc-DPC[s, {4, ■ • ■ ■ ■ ■ ,tk+i}\Ho, Fq], 

- Find H[s,ti\Hi,Fi U F'], where F' = ^il- 

~ Merge the /c-DPC and the hamiltonian path obtained in the previous 
step with the edge (s, s) and the free bridges Bj, 2 < j < ki. 

4. When (s, s) is not the free bridge of s: Let (s, x, y) be the free bridge of s. 

4.1 When (s, s) ^ F, s £ T, and ki > 2: Let z he ti if s yf ti; otherwise, let 

z be ^ 2 - If s yf U, we assume w.l.o.g. that s = t 2 - 
- Find /c-DPC[s, {<3, . . . , . . . , tk+i} U {x}\Ho, Eq]. 

~ Find H[y,z\Hi,Fi U F'], where F' = {s} U U 3 <y<fei ^i}- 
— Merge the fc-DPC and the hamiltonian path obtained with the edges 
(s, s), (x, y) and the free bridges Bj, 3 < / <k\. 

4.2 When (s, s) ^ F, s £ T, and ki = 1, or (s, s) G E, or s G E: In this 
case, it holds true that /o < /. (Recall Cl.) 

- Find A:-DPC[s, {t'a, • ■ • , ffei+i, ■ • • , tk+i}\Ho, Eq U {a;}]. 

- Find H[y,ti\Hi,Fi U E'], where E' = \J 2 <j<ki{'^i^j) ^il- 
~ Merge the fc-DPC and the hamiltonian path obtained with the edge 
(x,y) and the free bridges Bj, 2 < j < k\. 

When we find a fc-DPC or a hamiltonian path, sometimes we regard some 
fault-free vertices and/or edges as faulty elements. They are called virtual faults. 
For example, in step 4.2 of Construction I, the vertex a; is a virtual fault when 
we find a fc-DPC, and E' is a set of virtual faults when we find a hamiltonian 
path. 
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The existence of pairwise disjoint free bridges in step 2 of Construction I can 
be shown by extending Lemma 1. The proof is omitted in this paper. 

Lemma 2. We can always find k-DPC’s of Hq in step 3, 4-fi and 4-2 of Con- 
struction I. 

Proof. In each step, the number of sinks is k and the number of faulty elements 
including virtual faults is at most /. The fc-DPC’s can be found since Hq is 
/-fault one-to-many /c-disjoint path coverable. □ 

Lemma 3. We can always find hamiltonian paths of Hi in step 3, 4-1, and 4-2 
of Construction I unless C2 and C3. 

Proof. Let /' = |F'|. A sink tj in Hi contributes to f by 2 if tj is matched or 
if tj = s and (s, s) G F. For I > I, f' fi = {ki — 1) 2{l — 1) a fi = 

ki 1 ex f I — 2. When I ex ^ 2, fi fi ki {ko /q -I- ffi )-l“Cr-|-/i — 2< 
{k-\-l)-\-f — 2<f-\-k — 1. The equality holds only when I = ko fQ f 2 and 
f = fo + f 2 + 0 ( fi. Unless C3, /' -k /i < f + k — 2, and thus there exists a 
hamiltonian path between every pair of fault-free vertices in Hi. When l-\-a < 2, 
that is, 1 = 1 and a = 0, /' -k /i = fci -k 1 -k /i — 2. Unless C2, /'-k/i < / -kfc — 2. 
For Z = 0, /'-k/i < {ki — 1) -k/i- Also in this case, unless C2, /'-k/i < f -\-k — 2. 
This completes the proof. □ 

Let us consider the case that satisfies C3. Obviously, ki > 2. Observe that 
for every free vertex w{fi s) in Hi, (w,w) is a free bridge of w. 

Construction II 

UNDER the condition C3: Z -k a > 2, Z = fco + /o + /i") / = /o + /i + /i" + 

1. Let be a sink as defined in step 1 of Construction I. 

2. Find a free bridge of s and free bridges Bj = {tj,...,t'j) of tj for all j, 
2 < j <ki. They should be pairwise disjoint. 

3. When (s, s) is the free bridge of s (a = 0, Z > 2): We assume that ^2 is a 
matched sink. 

- Find H[s,ti\Hi,Fi\JF'], where F' = {(U, O)} U {U(Rj) C Ui}. 

Let the hamiltonian path {s, . . . ,t 2 , z, . . . , ti). Note that z is a free vertex. 
~ Find fc-DPC[s, {fi, ...,fi^, . . . , tk+i} U {z}\Hq, Fq]. 

— Merge the fc-DPC and the hamiltonian path with the edges (s, s), (z,z) 
and the free bridges Bj, 3 < j < ki. 

4. When (s, s) is not the free bridge of s: Let (s, x, y) be the free bridge of s. 

4.1 When (s, s) ^ F and s G T (a = 0, Z > 2, fci > 3): We assume that t 2 
is a matched sink and t^ is s. 

- Find H[y,ti\Hi,Fi UF'], where F' = {(ti, O)} U U 3 <j<fei C 
Vi}. Let the hamiltonian path {y,...,t 2 ,z,..., ti). 

- Find FDPC[s, {^ 4 , tfei-K, ■ • ■ , tk+i} U {a;, z}\Hq, Fq]. 

— Merge the fc-DPC and the hamiltonian path with the edges (s, s), 
(x,y), (z,z) and the free bridges Bj, 4 < j < ki. 
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4.2 When (s, s) G F or s G F: Let t 2 be a matched sink if s ^ T; otherwise, 
let ^2 be s. Note that /o < /• 

- Find H[y,ti\Hi,FiU F'], where F' = {(ti, ^2)} U U3<i<fei 

Vi}. Let the hamiltonian path ti). 

- Find /c-DPC[s, {<3 , . . . ■ ■ ■ ,tfe+i} U {z}|iLo,^b U {x}]. 

~ Merge the /c-DPC and the hamiltonian path with the edge {x,y), 
(z, z) and the free bridges Bj, 3 < j < ki. 

It is easy to observe the existence of fc-DPC’s of Hq in step 3, 4.1, and 4.2 of 
Construction II. 

Lemma 4. We can always find hamiltonian paths of Fli in step 3, 4,1, and 4-2 
of Construction II. 

Proof Let /' = |F'|. For a = 0 (^ > 2), f' + fi = {ki - 1) + 2{l - 2) + 1 + fi = 
ki + I + fi — 3 = ki + {ko + /o+/^) + /i— 3< {k + 1) + f — 3 = f + k — 2. 
For a = 1, f' + fi = {ki — ^ — 1) + 2{l — 1) + 1 + /i = ki + I + fi — 2 = 
ki + {ko + fo + /I") + /i — 2 = (A: + 1) + (/ — 1) — 2 = / + A: — 2. This completes 
the proof. □ 

Hereafter in this section, we will construct /-fault one-to-many k + 1-DPC’s 
for two exceptional cases of Cl and C2 and for the case of A;i = 0. 



2.1 When C2: fci + /i > fc + / 

The condition C2 is equivalent to that ko + {fo + f 2 ) < 1- When (s, s) is not the 
free bridge of s, we denote by (s, x, y) the free bridge of s. Let v be the endvertex 
of the free bridge of s in ili. Of course, u = s if (s, s) is the free bridge of s; 
otherwise, v = y. We let 

_ f 0, if (s, s) is the free bridge of s; 

[ {a:}, if at least one of (s, s) and s are faulty. 



Case 1: /o + /2 = 0 and ko = 0. 

We first consider the case that s is a free vertex or a faulty vertex. We find 
H[v,ti\Hi,Fi] and let the path be {v , . . . , tfc+i, Zfc-i-i, ...,tk,Zk,.. ■ ,^2,22, • • ■ ,Ai)- 
We find A:-DPC[s, {z2 , . . . , Zfc+i}|iLo, F’'], and then merge the A;-DPC and the 
hamiltonian path with the free bridge of s and the edges {zj , Zj ) for all 2 < 
j < k + 1. The existence of a hamiltonian path in iFi and a A:-DPC in Ho is 
straightforward . 

For the remaining case that s is a sink, we let ti = s. Find a hamiltonian cycle 
C = {t 2 , ...,z,t 3 ,...,u)m Hi\{FiUF"), where F" = {AijUlO, ■ • • ,Afc+i}- Tbe 
C exists since Hi is f + k — 1-fault hamiltonian and fi + \F”\ = fi + {k — 1) < 
f + k—1. We let Qi = {t 2 , ■ ■ ■ ,z) and Q2 = (A3 , . . . , rt) be two disjoint paths such 
that C = {Qi,Q 2 )- Find A;-DPC[s, {z, m, O, . . . , tk+i}\Ho, 0], and then merge the 
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Case 2: /o + /2 = 0 and ko = 1. 

Let tk+i be the sink in Hq. We consider the case that s is either a free vertex or a 
faulty vertex first. We find /c-DPC[w, , tk}\Hi,Fi], We let the v-tj path 

in the /c-DPC be {v, Zj, . . . , tj) for 1 < j < fc, and assume w.l.o.g. that Zj yf tk+i 
for every 2 < j < k. We find fc-DPC[s, {z 2 , . . . ,Zk, tfe+i}|iLo, F'], and then merge 
the two /c-DPC’s with the free bridge of s and {zj, Zj) for all 2 < j < fc. A A: + 1- 
DPC of Hq 0 Hi can be obtained by removing all the edges (v,Zj), 2 < j < k. 
It is easy to observe the existence of two Ac-DPC’s. 

When s € T, we let ti = s and assume w.l.o.g. that tj yf tk+i for all 
3 < j < k. Find a hamiltonian cycle C = {t 2 , z, . . . ,u) in Hi\{Fi U F"), 
where F" = {ti} U {ts, . . . ,tk}- We assume w.l.o.g. that z yf tfe+i. Find k- 
DPC[s, {z, ts, . . . , tfe, tfe+i}|iLo, 0], and then merge the fc-DPC and C with the 
edges (s, s), (z, z), and for all 3 < j < k. 

Case 3: /o 0 /2 = 1 and ko = 0. 

We assume w.l.o.g. that (ti,ti) is the free bridge of ti. Let us first consider the 
case that either (s, s) is a free bridge of s or at least one of (s, s) and s are faulty 
elements. We find fc-DPC[u, {t 2 , ■ ■ ■ ,tk+i}\Hi, FiU{ti}]. The fc-DPC exists since 
/i 0 1 = /i 0 (/o 0 / 2 ) < /• We let the v-tj path in the fc-DPC be {v, Zj,. . . , tj), 
2 < j < k 1. We assume w.l.o.g. that (zj,Zj) is the free bridge of Zj for all 

2 < j < k. Find fc-DPC[s, {ti, Z 2 , . . . , Zk}\Ho, Fq U F']. The existence of the fc- 

DPC is due to the fact that |F'| = 1 if and only if /i 0 /2 > 1 (equivalently, 
/o < / — 1). Finally, we merge the two fc-DPC ’s with the free bridge of s and 
the edges (ti,ti), and {zj,Zj) for all 2 < j < k. 

Now, (s, s) is a fault-free edge and s is a sink. We let ti = s. Find fc- 
DPC[y,{t 2 , . . . ,tk+i}\Hi, Fi U {ti}], and let the y-tj path in the fc-DPC be 
{y, Zj, . . . ,tj), 2 < j < fc 0 1. We assume w.l.o.g. that (zj,Zj) is the free bridge 
of Zj for every 2 < j < k. We find fc-DPC[s, {z 2 , . . . , zfc} U {x}\Ho, To]- Finally, 
we merge the two fc-DPC’s in a very similar way to the previous case. 

2.2 When Cl: fei = 1, s 0 T, /o = / 

We let s = ti- Find fc-DPC[s, {t 2 , ■ ■ ■ , tk+i}\Ho, Fq]- Since / 0 fc < <5 — 1, there 

exists j, 2 < j < fc 0 1, such that the s-tj path Pj in the fc-DPC passes 

through at least two vertices z and u adjacent to s satisfying (s,z), (s,u) ^ F. 
Let Pj = (Qi,Q 2 ), where Qi = {s, z, . . . ,x), Q 2 = {u,...,tj). That is, Pj = 
{s, z, . . . ,x,u, . . . ,tj). We redefine Pj to be Pj = (s,Q 2 ) = {s,u, . . . ,tj), and 
construct s-ti path in a way that P\ = {Qi,x, H[x,ti\Hi,^). All the s-tj paths 
form a fc 0 1-DPC in Hq © Hi. 

2.3 When = 0 

If (s, s) is not the free bridge of s, we let (s, x, y) be the free bridge of s. First, 
let us consider the case that either /i 0 /2 = 0 or /i 0 /2 = 1 and (s, s) is not 
the free bridge of s. Obviously, when fi-\-f 2 = l and (s, s) is not the free bridge 
of s, s € F or (s, s) 0 F. Thus, in this case, for every vertex v in Hq different 
from s, (v,v) is fault-free. We let 
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F' = if /i + /2 — 0; 

[ {a:}, if /i + /2 = 1 and (s, s) is not the free bridge of s. 

We first find fc-DPC[s, FqU F']. Here, sink tk+i is regarded as 

a free vertex. The /c-DPC exists since fo + \F'\ < f ■ Some s-tj path Pj in the 
fc-DPC passes through tk+i, and let Pj = {s,Qi,tk+i, z,Q 2 ,tj). Define s-tk+i 
path to be (s, Qi,tk+i)- We denote by Bg the free bridge of s, and let v be the 
endvertex of Bg in i?i. Redefine Pj to be Pj = {Bg,H[v, z\Hi,Fi],z, Q 2 , tj). All 
of the s-tj paths form a k 1-DPC in Hg 0 Hi. 

Now, we consider the case that either /i 0 /2 > 2 or /i 0 /2 = 1 and (s, s) is 
the free bridge of s. If there is a sink tj such that {tj,tj) is the free bridge of tj, 
then we let ti be such a sink. If there is no such a sink, we let ti be an arbitrary 

sink and let (ti, z, z) be the free bridge of t\. We let F' = F^ LI F[^, where 

_ f 0, if (s, s) is the free bridge of s; 

® [ {a;}, if (s, s) is not the free bridge of s, and 

p! \ {^i}) if (ti,ti) is the free bridge of 0; 

{ti,z}, if (ti,ti) is not the free bridge of 0. 

We find /c-DPC[s, {^ 2 , • ■ • , tfc+i}|^^ 0 ) and then find H[v, w\Hi,Fi], where 

w is the endvertex of the free bridge of ti in H\. The existence of such a 

fc-DPC is due to the following lemma. The fc-DPC and an s-ti path Pi = 

{s,Bg, H[v,w\Hi, Fi], Bi,ti) form a fc 0 1-DPC, where Bi is the free bridge 
of ti- 

Lemma 5. If either /i 0 /2 > 2 or /i 0 /2 = 1 and (s, s) is a free bridge of s, 
then /o 0 /' < f, where f = \F'\. 

Proof Observe that if (s, s) is not the free bridge of s, then at least one of s and 
(s, s) are faulty elements. Similarly, if (tj,tj) is not the free bridge of tj, then 
tj G 0 or {tj,tj) G F. When (ti, ti) is not the free bridge of ti, /i 0 /2 > fc 0 1, 
and thus fo < f — k — 1 and fo + f < fo + i < (/ — fc — 1) 0 3 < /. Recall that 
fc > 2. Now, let us consider the case that (ti, ti) is the free bridge of ti. If (s, s) 

is the free bridge of s, /o 0 /' = /o 0 1 < / since /i 0 /2 > 1- If (s, s) is not the 

free bridge of s, /i 0 /2 > 2 by assumption, and thus /o 0 /^ = /o 0 2 < /. □ 

3 Application 

A graph G is called fully one-to-many disjoint path coverable if for any / > 0 
and fc > 2 such that / 0fc<<5(G) — 1, Gis /-fault one-to-many fc-disjoint path 
coverable. A graph G is called almost fully one-to-many disjoint path coverable 
if for any / > 0 and fc > 3 such that f k < (5(G) — 1, Gis /-fault one-to- 
many fc-disjoint path coverable. Note that almost fully one-to-many disjoint path 
coverable graph G which is (5(G) — 3-fault hamiltonian-connected is fully one-to- 
many disjoint path coverable. The following is immediate from Theorem 1. 

Corollary 1. Let Hi be a graph with n vertices satisfying the following two 
conditions, where St = 6{Hi), z = 0, 1. 
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(a) Hi is fully one-to-many disjoint path coverable. 

(b) Hi is Si — 2-fault hamiltonian. 

Then, Hq 0 Hi is almost fully one-to-many disjoint path coverable. 

3.1 Recursive Circulants G(2"^,4) 

G(2"‘,4) is an m-regular graph with 2™ vertices. According to the recursive 
structure of recursive circulants [6], we can observe that G(2™,4) is isomorphic 
to G(2'"“^,4) X K 2 ©M G(2™“^,4) x K 2 for some permutation M. Obviously, 
G( 2 ™- 2 ^ 4 ) X is isomorphic to G(2’”“^,4) ©m' G(2’”“^,4) for some M'. First 
we consider fault-hamiltonicity of G(2'", 4) and G(2’”, 4) x K 2 , and then we show 
that G(2™,4), m > 3, is fully one-to-many disjoint path coverable. Lemma 7 
implies that G(2"‘“^,4) x K 2 of degree m, m > 4, is m — 3-fault hamiltonian- 
connected and m — 2-fault hamiltonian. The proof of Lemma 7 is omitted in this 
paper due to space limit. 

Lemma 6. [7] G(2’”, 4), m > 3, is m— 3-fault hamiltonian- connected and m—2- 
fault hamiltonian. 

Lemma 7. If a graph G is 6{G) — 3-fault hamiltonian- connected and S(G) — 2- 
fault hamiltonian for 6{G) > 3, then G x K 2 is 6{G x K 2 ) — 3-fault hamiltonian- 
connected and 5{G x K 2 ) — 2-fault hamiltonian. 

Theorem 2. G(2'",4), m>3, is fully one-to-many disjoint path coverable. 

Proof. The proof is by induction on m. For m = 3, the theorem holds true by 
Lemma 6. For m = 4, by Lemma 6, it is sufficient to show that G(2^, 4) is 0-fault 
one-to-many 3-disjoint path coverable. The proof is mainly by case analysis, and 
omitted here. For m > 5, by Corollary 1 and Lemma 7, G(2’”“^,4) x K 2 is fully 
one-to-many disjoint path coverable, and thus, by Corollary 1 and Lemma 6, 
G(2’”,4) is fully one-to-many disjoint path coverable. □ 

3.2 Twisted Cube TQmi Crossed Cube CQm 

Originally, twisted cube TQm is defined for odd to. We let TQm = TQm-i x K 2 
for even to. Then, TQm is isomorphic to TQm-i ©m TQm-i for some M. Also, 
crossed cube GQm is isomorphic to GQm-i ©m' GQm-i for some M' . Both 
TQm and CQm are m-regular graphs with 2™ vertices. The fault-hamiltonicity 
of them were studied in the literature. 

Lemma 8. (a) TQm, m> 3, is m — 3-fault hamiltonian- connected and m — 2- 
fault hamiltonian[4]. (b) CQm, m>3, is m — 3-fault hamiltonian- connected and 
TO — 2-fault hamiltonian [3] . 

By Corollary 1 and Lemma 7 and 8, we have the following. 

Theorem 3. TQm (ind CQm, rn > 3, are fully one-to-many disjoint path cov- 
erable. 
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3.3 One-to-One Disjoint Path Covers 

Similar to the definition of a one-to-many disjoint path cover, we can define a one- 
to-one disjoint path cover in a graph with faulty elements. In a straightforward 
manner, when f + k < S(G) — 1, we can construct /-fault one-to-one fc-DPC 
of a graph G joining s and t using /-fault one-to-many fc-DPC joining s and 
■ • ■ ,ik}, where ti = t and for every 2 < j < k, tj is a fault-free vertex 
adjacent to t such that is also fault-free. 

Proposition 2. Let G be a graph, and let / > 0. (a) G is f -fault one-to-one 
l-disjoint path coverahle if and only if G is f -fault hamiltonian- connected, (b) G 
is f -fault one-to-one 2-disjoint path coverable if and only if G is f -fault hamilto- 
nian. (c) Let k > 1, and f -\-k < S(G) — 1. LfG is f -fault one-to-many k-disjoint 
path coverable, then G is f -fault one-to-one k-disjoint path coverable. 

Theorem 4. For m > 3, G(2'",4), TQ^, and GQm are f -fault one-to-one 
k-disjoint path coverable for any / > 0 and k >2 such that f -\- k < m — 1. 

It was shown that fault-free G(2’”, 4), to > 3, is (0-fault) one-to-one /c-disjoint 
path coverable for any 1 < /c < to[ 5]. The construction of /-fault one-to-one k- 
DPC’s of G(2’”,4), TQm, and GQm satisfying f -\- k = m remains open. 
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Abstract. This paper proves that for every positive integers n, k and 
any positive constant e, we can explicitly construct a graph G with n -f 
vertices and a constant degree such that even after removing 
any k vertices from G, the remaining graph still contains an n-vertex 
dipath. This paper also proves that for every positive integers n, k and 
any positive constant e, we can explicitly construct a graph H with 
n + 0{k^'^^) vertices and a constant degree such that even after removing 
any k vertices from H, the remaining graph still contains an n-vertex 2- 
dimensional square mesh. 



1 Introduction 

We consider the following problem motivated by the design of fault-tolerant mul- 
tiprocessor systems. Let G be an undirected or directed graph. If G is undirected 
then let V (G) and E{G) denote the vertex set and edge set of G, respectively, 
and (u, v) denote the edge connecting u and v in G. If G is directed then let 
V (G) and A{G) denote the vertex set and arc set of G, respectively, and (u, v) 
denote the arc from m to u in G. L\(G) is the maximum degree of a vertex in 
G. For any S C V{G), G — S' is the graph obtained from G by deleting the 
vertices of S together with the edges incident with the vertices in S. Let /c be a 
positive integer. A graph G is called a fc-FT(fc-fault-tolerant) graph for a graph 
H ii G — F contains H as a subgraph for every F C V{G) with \F\ < fc. If G 
is a DAG (directed acyclic graph), which implies that F[ is also a DAG, then G 
is called a fc-FT DAG for F[. Our problem is to construct a /c-FT graph G for a 
given graph FI such that |Y(G)| and A{G) are as small as possible. 

A large amount of research has been devoted to constructing fc-FT graphs for 
an n-vertex (undirected) path P„ [1,4,5,8-10,13-16], fc-FT DAGs for an n-vertex 
dipath(directed path) [4,6,14-16], and fc-FT graphs for a 2-dimensional m- 
sided mesh M 2 (to) [2-4,7,11,14-16]. 

Among others, Bruck, Gypher, and Ho [4] show a fc-FT DAG for with 
n + 0{k'^) vertices and maximum degree of 4, and a fc-FT graph for M 2 (to) with 
rn? + 0{k^) vertices and a constant maximum degree. Zhang [15, 16] shows a 
fc-FT graph for Pn with n-|-0(fclog^ fc) vertices and maximum degree of 0(1), a 
fc-FT DAG for P„ with n-l-O(fclogfc) vertices and maximum degree of O(logfc), 
and a fc-FT graph for M 2 (to) with w? + 0(fc^/logfc) vertices and 0(log^ fc) 
maximum degree. Zhang [15, 16] also raised the following three open questions: 
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(Ql) Is it possible to construct a fc-FT graph for an n- vertex path P„ with 
n + 0{k) vertices and a constant maximum degree? 

(Q2) Is it possible to construct a /c-FT DAG for an n-vertex dipath with 
n + 0{k log k) vertices and a constant maximum degree? 

(Q3) Is it possible to construct a fc-FT graph for a 2-dimensional m-sided mesh 
M 2 {m) with + 0{k^) vertices and a constant maximum degree? 

Yamada and Ueno [14] settle (Ql) by showing an explicit construction of fc-FT 
graph for with n + 0{k) vertices and maximum degree of 3. 

This paper considers (Q2) and (Q3), and shows the explicit constructions 
of a fc-FT DAG for with n + 0{k^~^'^) vertices and a constant maximum 
degree, and a fc-FT graph for M 2 {m) with -I- 0(fc^“*'®) vertices and a constant 
maximum degree for any constant e > 0. Our construction of fc-FT graphs for 
M 2 {m) is based on the method proposed by [15, 16] combined with our fc-FT 
DAG for P„. 



2 Fault-Tolerant Directed Linear Arrays 



For any positive integer n, let [n] = {0, 1, . . . , n — 1}. The n-vertex dipath P„ is 
the digraph defined as follows: 

V{Pn) = [n]; A{Pn) = {{u,v) :u,vG [n],v = n-|- 1}. 



For any positive integers N, t, and d, we define t as the following DAG: 
V{DN^t,d) = [^]; A{DN,t,d) = {{u,v) : V = n-hP for a: e [d+ 1]}. 
Then, we can prove the following lemma. 

Lemma 1. Let N,t,d,k, and n he positive integers. If k < P and 

n + dk{t — 1) — 1 



N > 



P 



+ 1 



X P 



then Djsi^t.d is a k-FT DAG for P^. 

Proof. Assume without loss of generality that 

n -I- dkft — 1) — 1 



N = 



P 



+ 1 



X P. 



Notice that N/P is a positive integer for any x & [d-l- 1]. Fix any F C V{DM,t,d) 
with jPj < fc. A vertex in F is said to be faulty. We prove that Dff,t,d — F 
contains P„ as a subdigraph. 

For any x G [d+ 1] and y G [t®], define that U{x,y) = {y + jP : j G [N/P]}. 
First, we prove the following claim. 
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Claim 1. There exists a sequence of integers, yo,yi, ■ ■ ■ ,yd, such that yo = 0, 
iVx+i-yx )IC & [t] for any x G [d], and U{x,yx) has at most k/C faulty vertices 
for any x & [c? + 1] . 

Proof of Claim 1 . We prove the claim by constructing the sequence inductively. 

C/(0,j/o) = V{Dn,t,d) has at most k/C = k faulty vertices by assumption. 

Let S G [d], and assume that there exists a sequence of yo,yi, . . . ,ys such 
that yo = 0, {yx+i — yx)/C G [t] for any x G [<5], and U{x,yx) has at most 
k/C faulty vertices for any x G [d + 1]. Since U{S,ys) has at most k/t^ faulty 
vertices and {U{S + l,ys), U{S + l,j /5 + . . . ,U{S + l,ys + {t — 1)^“^)) is a 

partition of U{S,ys), there exists some is+i G [t] such that U{5 + l^ys + is+\t^) 
has at most {k/C)/S = k/C~^^ faulty vertices. Hence, the claim holds by putting 
ys+i = ys + is+it^ ■ □ 

Let yo,yi, ■ ■ ■ ,yd be a sequence of integers satisfying the condition in Claim 
1. For any x G [d+ 1], let kx denote the number of faulty vertices in U{x,yx)- 
By Claim 1, we obtain that kx < k/C. Since kd < k/C < 1, we obtain kd = 0- 
That is, U{d,yd) has no faulty vertices. 

We inductively construct d + 1 fault-free dipaths, denoted by L{d),L{d — 
1), . . . , 1/(0), from yd to yd+ {N ~ as follows: 

(i) Define that L(d) = {yd,yd + f^, ■ ■ ■ ,yd+(N — C)). Then L(d) is a fault-free 

dipath from yd to yd + {N — C) since every vertex of C/(d, yd) = {yd + ■ j G 

[N/C]} is fault-free,. 

(ii) Assume that a fault-free dipath L{x + 1) from yd to yd + {N — C) is 

already constructed. Then, we define L{x) as the dipath obtained from L{x+ 1) 
by replacing each arc {v,v + S A{L{x + 1)) with a dipath L' = {v,v + 

C, . . . ,v+ {t— 1)C, V + if every vertex of L' is fault-free. By the definition 
of L{x), L{x) is a fault-free path from yd to yd + {N — t‘^)- 

We prove that L(0) is a dipath with at least n vertices. Let Tx be the number 
of arcs {v,v + C) G A(L(0)). Since Vx+i < kx — kx+i for any x G [d], we obtain 

|C(L(0))| 

d d 

= {N -d + 1) - rx{C -l)>n + dk{t - 1) - '^{kx-i - kx){C - 1) 

x—1 x—1 

d—1 d—1 

= n + dk{t - 1) - XI kx{C"^^ -C)>n + dk{t - 1) - '^{k/P){P+^ - P) 

a :— 0 a :— 0 

d-1 

= n + dk(t — 1) — X] — 1) = n -I- dkft — 1) — dkft — 1) = n. 

x—0 

Therefore, L(0) is a dipath with at least n vertices. Since every vertex of L(0) is 
fault-free, DNd,d — F contains P„ as a subdigraph, and hence we conclude that 
F>N,t,d is a fc-FT DAG for P„. □ 

Theorem 1. For any positive integers n, k and any constant e > 0, we can 
construct a k-FT DAC for P„ with n+0{k^'^^) vertices and a constant maximum 
vertex degree. 
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Proof. Fix c? 
and 



[1/e], and set t and N as integers satisfying that < k < 

n + dk(t — 1) — 1 



N = 






+ 1 



X 



By Lemma 1, DN,t,d is a /c-FT DAG for P„. By the definition of DN,t,d, the 
maximum degree of a vertex in P)N,t,d at most 2{d+ 1), which is a constant. 
Since t < k^/'^ + 1, we obtain 



N <n + -h +lY = n + 0{k^+Y. 



which completes the proof of the theorem. 



□ 



3 Fault-Tolerant Meshes 

For any positive integers r and c, the 2-dimensional r x c mesh, denoted by 
M(r, c), is the graph defined as follows: 

V{M{r,c)) = [r] x [c]; 

P(M(r,c)) = {([t, j], [/,/]) : It - i'l + \j - f\ = 1}. 

M{m, m) is denoted by M 2 (m), which is called the 2-dimensional m-sided mesh. 



3.1 Basic Construction 



Let N, t, d, r, and c be any positive integers. ^ is the graph defined as follows: 



y{GYt,d) = [^^]; E{G^N:t,d) = U 



where 



Ey,ov{GY^ Y = {{u, u) : |t> - u| = G for x £[d+ 1]} and 
Ever{GY^Y = ■ \v -u\- CG [N - TC + 1]}. 



Lemma 2. If k <f^ and 



N > 



rc + dk(f — 1) — 1 

¥ 



+ 1 



X C 



then G^t d ® k-FT graph for M{r, c). 

Proof. In order to prove the lemma, it suffices to show the following claim: 

Claim 2. For any F CV {G^t d) 1^1 — there exist rc vertices uq, fi, . . . , 
Vrc-i € V{G'^Yt d) ~ E satisfying the following three conditions: 

1. uo < ui < • • • < frc-i; 

2. {vs,,Vsi;+i) G Ghor(G[/r']t_^) for every a; G [rc - 1]; and 

3. {vx,Vx+c) G E^eYGY'd.d) for every a: G [rc - cj. 
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Proof of Claim 2. By Lemma 1, there exist rc vertices vo,vi, . . . ,Vrc-i G 
V d) ~ ^ satisfying conditions 1 and 2. Since 

~{vx--x€ [rc]}| = N -rc, 

we obtain that c < Vx+c — Vx < c + fV — rc for every x G [rc — c], and so 

{Xx+c,Xx} € ^'ver{G D 

By Claim 2, d~ ^ contains M(r, c) as a subgraph for any F C 

with |F| < k, and hence G^f^^ d is a fc-FT graph for M{r,c). □ 

By Lemma 2, we have the following theorem. 

Theorem 2. For any positive integers r, c, and k, and any constant e > 0, we 
can construct a k-FT graph for M{r, c) with rc+0{k^~^^) vertices and maximum 
vertex degree of 0{k^'^'^). 

3.2 Tower Graphs 

For any positive integer h, the tower graph Q{h) of height h is the graph defined 
as follows: 

V{Q{h)) = {[l,p\-.lG[h + l],pG[2’^-% 

E{Q{h)) = Et,{Q{h)) U Gip(Q(/i)), 

where 



EtriQih)) = {{[l,p], [I +l,q\) : q= \p/2\} and 
E\p{Q{h)) = {{[l,p], [l,q]) :q={p±l) mod 2'*"'}. 

I and p are called the level and position of a vertex [l,p], respectively. An edge 
in Et^{Q{h))[Eip{Q{h))] is called a tree[loop] edge, [/i, 0] and [0,p] are called a 
root and leaf of Q{h), respectively. It is easy to see that \V{Q{h))\ = 2^+^ — 1 
and A{Q{h)) < 5. Fig.l shows Q(3). 





Fig. 2. Hamilton Path H{3, 4) in Q(3) 



For any positive integer h, p G [2^], and I G [h + 1], wi{h,p) is defined as 
follows: Wo{h,p) = p] wi+\{h,p) = [{wi{h,p) — 1)/2J for any I G [h]. For any 
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p G [2^], let H{h,p) denote the Hamilton path of Q{h) from [0,p] to [h, 0] defined 
as follows: Iterate the following through I = 0 to h — 1: 

— Traverse loop edges in the order of [I, wi{h,p)], [I, wi{h,p) + l],. . . ,[l, 2^“^ — 1], 

[l,wi{h,p) - 1]; 

— Go up from [l,wi{h,p) — 1] to [I + 1, using a tree edge. 

Fig. 2 shows H(3,4). It is easy to see that wo(3,4) = 4, wi(3,4) = 1, W2(3,4) = 0, 
and ^3(3, 4) = 0. See Fig. 2. 

For any positive integer i < 2^"’'^ — I, let s{h,p, i) denote the f-th vertex from 
[0,p] in H{h,p). The following lemma is proved in [15,16]. 

Lemma 3. [15,16] For any p,q € [2^], p < q, and any positive integer i < 

2^+1 _ s{h,p,i) and s{h,q,i) are vertices on the same level I, and 



{n{h,q,i) — Tr{h,p, i)} mod 2^ * 



q-p 

2 ' 



e[2], 



where T:{h,p,i) and T:{h,q,i) are the positions of s{h,p,i) and s{h,q,i), respec- 
tively. 



3.3 Improved Construction 



In this subsection, for any positive integers to, k, and d, we construct a graph 
Gk.di'rn), which is shown to be a fc-FT graph for M2 (to) with to^ -I- 
vertices and maximum degree of 0{d^). 

Set t as an integer with (t — lY < k < and h = \log{dk{t — 1) + 2t‘^)~\ . 
For simplicity, we assume that 2^+^ — 2 is a divisor of to. Let r = — 2), 



c = m, and 



N = 



rc dk{t — 1) — 1 
td- 



+ 1 



X r 



Notice that N — rc 1 < dk{t — 1) -|- 2f^ < 2^. 

We define Q{h) as the graph obtained from two copies, Q^{h) and Q^{h), of 
Q{h) by connecting a root of Q^{h) and that of Q^{h) by an edge. Fig. 3 shows 
Q(3). For any p,q & [2^], let H^{h,p) and H^{h,q) denote the Hamilton paths 
of Q^{h) and Q^{h) corresponding to H{h,p) and H{h,q) in Q{h), respectively. 
H{h,p,q) is the Hamilton path of Q{h) obtained from H^{h,p) and H^{h,q) by 
connecting a root of Q^{h) and that of Q^{h) by an edge. Let s'{h,p, q, i) denote 
the f-th vertex from [0,p] G V{Q'^{h)) in H{h,p,q). 

For any positive integers t, d, and I, define that 



St,d = {1° -t^ : 0 < a < P < d} and St,d,i = | ^ + 6 : a; G St^d, b€ [2] | . 



Qk,d{'m) is the graph obtained from ^ as follows: 

(Cl) Replace each vertex v of ^ with a copy of Q{h)\ 




408 



Toshinori Yamada 





(C2) If (u,v) G ^hor(G^'^j^) then connect each vertex [l,p] in Q^lQl,] with 
[I, {p±S) mod 2^“'] in Ql[Ql] by an edge for every 5 G St,d,u where Q^lQl] 
represents a copy of Q{h) in Qy corresponding to in Q{h)] 

(C3) If (u,v) G and u < v then connect a vertex [0,p] in Q]^ with 

a vertex [0,p] in where p = {v — u) — c. 

It is easy to see the following two lemmas. 

Lemma 4. \V{Qk,d (m))| = w? + 

Lemma 5. A{Qk,d{'m)) = 0{(P). 

Lemma 6. Gk,d{'rn) is a k-FT graph for M 2 {m). 

Proof. Fix any F C V{Gk,d{'m)) with |F| < k. Let F' = {v : Qy has a vertex in 
F}. Since |F'| < k, there exist rc vertices vq, ■ . ■ ,Wrc-i G j) — F' satis- 

fying the conditions in Claim 2. For any a; G [r -I- 1] and y G [c], let 



^x.y — 



0 

{Py+xc ’^y+{x — l)c) ^ 



if a; = 0 or X = r, 
otherwise. 



We define a mapping (f : V{M 2 {m)) V{Gk,d{'m)) — F as follows: For any 

G C(M2(m)), 

(j){[mi,m2]) = s'y^^^^{h,Za;,y,Zy+i^y,i), 

where s'y{h,p,q,i) is the vertex in Qy corresponding to s' {h,p,q,i) in Q{h), 
X = [toi/( 2^+^ — 2)J, i = mi mod (2^+^ — 2), and y = m 2 . 



Claim 3. For any mi G [m— 1] and m 2 G [m], ^([mi, m 2 ]) is adjacent to ^([mi -I- 

l,m2]). 
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Proof of Claim 3. Let a; = — 2)J , i = toi mod (2^+^ — 2), and y = TO 2 . 

If i ^ 2'*+2 - 3 then 

+ l,m2]) = z^^y, z^+i^y,i + 1)- 

Therefore, 4>{[mi,m2\) is adjacent to 4>{[mi + 1,TO2]). 

If i = 2^+2 - 3 then 

(j}{[mi,m2\) = Zo,,y, z^+i^y,2^+'^ - 3) 

(filmi + I, m 2 ]) = s'^^^^^^^^^{h,Zoo+i,y,z^+2,y,0). 

That is, i^([toi,TO 2 ]) is a leaf [0,Zx+i,y] in and 4>{[mi + l,m 2 ]) is a leaf 

[0,Zx+i,y] in Since Zx+i,y = {vy+(^+i)c- Vy+^c) ~ c, we conclude that 

4 >{[mi,m 2 ]) is adjacent to 4>{[mi + l,m 2 ]). □ 

Claim 4- For any m\ G [m] and m 2 G [to — 1], 4>{[mi,m2]) is adjacent to 

(/)([toi,TO2 + 1]). 

Proof of Claim 4- Let a; = [toi/( 2'*+^ — 2)J , i = to-i mod (2^*+^ — 2), and y = to- 2 . 
Then, 

(/)([toi,TO 2 ]) = s'^^^^^{h,Zoo,y,z^+i^y,i) and 
^([toi,TO2 + 1]) = s'^^^^^_^^{h,z^,y+i,z^+i^y+i,i). 

Assume that i < 2^+^ — 2. Then, 

i^([toi,TO 2 ]) = s°^_^^^{h,z^^y,i) and ^([toi,TO 2 + 1]) = s°^^^^_^^{h,z^^y+i,i). 

We consider two distinguished cases, (1) a; = 0 and (2) a; yf 0. 

(1) If a; = 0 then 

(/)([toi, TO 2 ]) = {h, 0, i) and (/)([toi, TO 2 + 1]) = {h, 0, i). 

That is, <j){[mi,m 2 \) and (j){[m\,m 2 + 1]) are vertices of and Q2 „+i corre- 
sponding to the same vertex of Q{h), respectively. Since (vy,Vy+i) G Ahor(G[v^t d) 
and 0 G St^d, we conclude that i^([toi,TO 2 ]) is adjacent to 0 ([toi,to-2 + 1]). 

(2) Assume that a; yf 0. Then, Za;,y = {vy+xc ~ Vy+(^x-i)c) ~ c and Zx,y+i = 

i'^y+xc+i ^y+{x—i)c+i) c. Since (^Vy_^(^x—i)cT'^y+{x—i)c+i) C ^hori^G and 
{'^y+xcj '^y+xc+i) G Gj\iqi{^G ^ ^ ^ we obtain Vyj^(^x—i)c+i '^y+(x—i)c ~ ^ and 

Vyj^xc+i — Vy+xc = foc somo a , (3 G [d -I- 1]. Hence, 

^x^y+l ^x^y — (^y-t-a:c-t-l '^y+xc) (Py+(x—l)c+l '^y+{x — l)c) — f ■ 

Assume without loss of generality that P > a. Then, by Lemma 2, Zx,y, 

i) and Zx,y+ic) are the same position I in and re- 

spectively, and 
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that is 

T^Vy+,,+ Ah,Za^,y+l,i) 



^Vy+,A^,Za:,v,i) + 



-r 




mod 2^ ’■ 



for some b € [2], where z^;^y,i) and (/i, t) are the posi- 
tions of s°^^^^{h,Za;,y,i) and z^;^y+i,i), respectively. Since - t°‘ e 

St,d, we conclude that (j>{[mi,m 2 ]) is adjacent to -I- 1]). 

By a similar argument, we can also prove the claim when 2^+^ — 1 < z < 

2 h +2 _2 □ 

By Claims 3 and 4, Gk,d{'m) — F contains M 2 {m) as a subgraph, and hence 
Gk,d{iTi) is a fc-FT graph for M 2 {m). □ 



Next, we describe the construction of a /c-FT graph Gk,d(jn) for M 2 {m) in 
the case when 2^+^ — 2 is not a divisor of m. 

(C’l) Set r = [m/(2^+^ — 2)J and c = m. Perform (Cl), (C2), and (C3). 

(C’2) For each v > (r — l)c, add 7 = TV — rc(2^+^ — 2) -|- 1 vertices, 0^4, a„, 2, ■ • • , 
a„,T,, and 7 -I- 1 edges, (a„,o, a«,i), (0^,7, fli,, 7+1), to in Gk,d{'^')j 

where and a„_7+i are a root of and that of Q^, respectively (See 
Fig.4); 

(C’3) If (u,v) G and u,v > {r - l)c then add 7 edges, (au,x,av,x) 

{x = 1,2,..., 7), to the graph obtained in (C’2). 

Lemmas 4, 5, and 6 hold even in the case when 2^+^ — 2 is not a divisor of m 
by a similar argument as the proofs in the case when 2^+^ — 2 is a divisor of m. 
Hence, we have the following theorem. 

Theorem 3. Gk,d{'m) is a k-FT graph for M 2 {m) with \V{Gk,d{iTi))\ = mf + 
and A{Gk,d{rn)) = O(d^). 

Theorem 4. For any positive integers m, k and any constant e > 0, we can 
construct a k-FT graph for M 2 {m) with n + 0{k^~^^) vertices and a constant 
maximum vertex degree, where n = |y(M2(m))| = . 

Proof. The theorem follows from Theorem 3 by putting d = [2 /e~\ . □ 



4 Concluding Remarks 

Since a fc-FT graph Vn,k for an n-vertex path P„ with n + 0{k) vertices and 
maximum degree of 3 is shown in [14], it is possible to construct a /c-FT graph for 
M 2 {m) with mf + 0{k^) vertices and a constant maximum degree if A: = f2{m). 
In fact, the following graph Gf^{rn) is a desired graph: 

ViG'kim)) = V{Vm,k) X 

E{G'k{m)) = {([mi,w], [u2,f]) : (^1,^2) G P(Pm.fe)} 

U {([zz,z;i], [u,V2]) : (^1,^2) G E{Pm)}- 

However, it still remains open whether it is possible to construct a fc-FT graph 
for M2 (to) with mf + O(k^) vertices and constant maximum degree for any 
k = o{m). 




Fault-Tolerant Meshes with Constant Degree 



411 



References 

1. N. Alon and F. Chung. Explicit construction of linear sized tolerant networks. 
Discrete Math., 72:15-19, 1988. 

2. J. Brack, R. Cypher, and C. Ho. Fault-tolerant meshes and hypercubes with min- 
imal numbers of spares. IEEE Trans, on Comput., pp. 1089-1104, 1993. 

3. J. Bruck, R. Cypher, and C. Ho. Tolerating faults in a mesh with a row of spare 
nodes. Theoretical Computer Science, 128(l-2):241-252, June 1994. 

4. J. Bruck, R. Cypher, and C. Ho. Fault-tolerant meshes with small degree. SIAM 
Journal on Computing, 26:1764-1784, 1997. 

5. S. Dutt and J. Hayes. Designing fault-tolerant systems using automorphisms. Jour- 
nal of Parallel and Distributed Computing, 12:249-268, 1991. 

6. P. Erdos, R. Graham, and E. Szemeredi. On sparse graphs with dense long paths. 
Comp, and Math, with AppL, 1:145-161, 1975. 

7. A. Farrag. New algorithm for constructing fault-tolerant solutions of the circulant 
graph configuration. Parallel Computing, 22:1239-1254, 1996. 

8. F. Harary and J. Hayes. Node fault tolerance in graphs. Networks, 27:19-23, 1996. 

9. J. Hayes. A graph model for fault-tolerant computing systems. IEEE Trans, on 
Comput., C-25:875-883, 1976. 

10. M. Paoli, W. Wong, and C. Wong. Minimum k-hamiltonian graphs, II. J. Graph 
Theory, 10:79-95, 1986. 

11. H. Tamaki. Construction of the mesh and the torus tolerating a large number of 
faults. In Proc. 6th Annual ACM Symposium on Parallel Algorithms and Arehitec- 
tures (SPAA ’94), pp. 268-277, 1994. 

12. S. Ueno, A. Bagchi, S. Hakimi, and E. Schmeichel. On minimum fault-tolerant 
networks. SIAM J. on Discrete Mathematics, 6(4):565-574, November 1993. 

13. W. Wong and C. Wong. Minimum k-hamiltonian graphs. J. Graph Theory, 8:155- 
165, 1984. 

14. T. Yamada and S. Ueno. Optimal fault-tolerant linear arrays. In Proe. 15th ACM 
SPAA, pp. 60-64, 2003. 

15. L. Zhang. Fault tolerant networks with small degree. In Proeeedings of Twelfth an- 
nual ACM Symposium on Parallel Algorithms and Architectures, pp. 64-69, 2000. 

16. L. Zhang. Fault-tolerant meshes with small degree. IEEE Transactions on Com- 
puters, 51(5):553-560, May 2002. 




Fault Hamiltonicity of Meshes 
with Two Wraparound Edges* 



Kyoung-Wook Park^, Hyeong-Seok Lim^, 

Jung-Heum Park^, and Hee-Chul Kim^ 

^ Department of Computer Science, Chonnam National University, 
300 Yongbong-dong, Buk-gu, Gwangju 500-757, Korea 
kwparkScsblue . chonnam . ac . kr , hslimOchonnam . ac . kr 
^ School of Computer Science and Information Engineering, 

The Catholic University of Korea 
j .h. park® catholic . ac .kr 

® School of Computer Information and Communications Engineering, 
Hankuk University of Foreign Studies 
hckimShuf s . ac . kr 



Abstract. We consider the fault hamiltonian properties of m x n meshes 
with two wraparound edges in the first row and the last row, denoted 
by M 2 {m,n), m > 2, n > 3. M 2 {m,n) is a spanning subgraph of Pm x 
Cn which has interesting fault hamiltonian properties. We show that 
M2 (m, n) with odd n is hamiltonian-connected and 1-fault hamiltonian. 
For even n, M 2 (m,n), which is bipartite, with a single faulty element is 
shown to be 1-fault strongly hamiltonian-laceable. In previous works[l, 
2], it was shown that Pm x Cn also has these hamiltonian properties. Our 
result shows that two additional wraparound edges are sufficient for an 
m X n mesh to have such properties rather than m wraparound edges. 
As an application of fault-hamiltonicity of M 2 {m,n), we show that the 
n-dimensional hypercube is strongly hamiltonian laceable if there are at 
most n — 2 faulty elements and at most one faulty vertex. 



1 Introduction 

Meshes represent the communication structures of many applications in scien- 
tific computations as well as the topologies of many large-scale interconnection 
networks. One of the central issues in parallel processing is embedding of linear 
arrays and rings into a faulty interconnection network. The embedding is closely 
related to a hamiltonian problem in graph theory. 

An interconnection network is often modeled as an undirected graph, in which 
vertices and edges correspond to nodes and links, respectively. A graph G is called 
hamiltonian-connected if there exists a hamiltonian path joining every pair of ver- 
tices in G. We consider the hamiltonian properties of a graph in the presence of 
faulty elements (vertices and/or edges). A graph G is called k-fault hamiltonian 

* This work was supported by grant No. ROl-2003-000-11676-0 from the Basic Re- 
search Program of the Korea Science & Engineering Foundation. 
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(resp. k-fault hamiltonian- connected) if G — F has a hamiltonian cycle (resp. a 
hamiltonian path joining every pair of vertices) for any set F of faulty elements 
such that jT’l < k. Apparently, a bipartite graph is not hamiltonian-connected. 
In [3], the concept of hamiltonian laceability for hamiltonian bipartite graphs 
was introduced. Bipartition sets of a bipartite graph are often represented as 
sets of black and white vertices. A bipartite graph G is hamiltonian-laceahle if 
there is a hamiltonian path joining every pair of vertices with different colors. In 
[4], this concept was extended into strongly hamiltonian laceability. A hamilto- 
nian laceable graph G with N vertices is strong if there is a path of length N — 2 
joining every pair of vertices with the same color. 

For any faulty set F such that |F| < k, a bipartite graph G which has an 
L°P*-path joining every pair of fault-free vertices is called k-fault strongly hamil- 
tonian laceable[2]. An L°P‘-path is defined as follows. Let G be a bipartite graph 
and let B and W be the sets of black and white vertices in G, respectively. De- 
note by Fy and Fe the sets of faulty vertices and edges in G, respectively. Let 
F = Fy U Fe, fy = |F„|, fe= \Fe\, and / = \F\. The numbers of fault-free black 
and white vertices are denoted by Ub and n^, respectively. When nt = n^, a 
fault-free path of length 2rib — 1 joining a pair of vertices with different colors is 
called an L°'^^-path. For a pair of vertices with the same color, a fault-free path 
of length 2ub — 2 between them is called an L°^^-path. When nt > n^, fault-free 
paths of length 2riw for a pair of black vertices, of length 2uw — 1 for a pair of 
vertices with different colors, and of length 2riyj — 2 for a pair of white vertices, 
are called L°P^-paths. Similary, an L°P*-path can be defined when > rib- A 
fault-free cycle of length 2-min{nh, n^.} is called an L°P^-cycle. The lengths of an 
L°P*-path and an L°P*-cycle are the largest possible. 

Fault hamiltonicity of various interconnection networks has been investi- 
gated. In [5] and [6], linear-time algorithms that find hamiltonian paths in mx n 
meshes were developed . In [7] and [8], the fault hamiltonian properties of mx n 
torus and Pm x Cn were considered, where Pm x G„ is a graph obtained by 
product of a path Pm with m vertices and a cycle G„ with n vertices. Pm x G„ 
forms an m X n mesh with a wraparound edge in each row. Futhermore, it was 
shown that Pm x G„ is hamiltonian-connected and 1-fault hamiltonian if it is 
not bipartite [1]; otherwise. Pm x G„ is 1-fault strongly hamiltonian laceable [2]. 

In this paper, we consider the hamiltonian properties of m x n mesh (m > 
2,n > 3) with two wraparound edges in the first row and the last row. We de- 
note the graph by M 2 (m, n). We show that M 2 (to, n) with odd n is hamiltonian- 
connected and 1-fault hamiltonian. For a graph G to be hamiltonian-connected, 
G should be non-bipartite and 5{G) > 3, where 5{G) is the minimum degree of G. 
For a graph G to be /c-fault hamiltonian, it is necessary that k < 6{G) — 2. Thus, 
M 2 {m,n) with odd n satisfies the above condition by adding two(minimum) 
edges to an TO X n mesh. Futhermore, for n even, we show that M 2 (to, n), which 
is bipartite, with a single faulty element is strongly hamiltonian laceable. In 
previous works [1,2], it was shown that Pm x G„ also has these hamiltonian 
properties. Our result shows that two additional wraparound edges are sufficient 
for an TO X n mesh to have such properties rather than to wraparound edges. 
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Fig. 1. Examples of M\(rn, n) and Ms(m, n) 



For some m and n, M2{m,n) is a spanning subgraph of many interconnec- 
tion networks such as tori, hypercubes, k-a,ry n-cubes, double loop networks, and 
recursive circulants. Thus, our results can be applied to discover the fault hamil- 
tonicity of such interconnection networks. It was shown in [ 9 ] that n-dimensional 
hypercube Qn with n — 2 faulty edges is strongly hamiltonian laceable. By ap- 
plying fault hamiltonicity of M2(m,n), we show that Qn with at most n — 2 
faulty elements and at most one faulty vertex is strongly hamiltonian laceable. 

2 Preliminaries 

Let M{m,n) = (V,E) be an m x n mesh, where the vertex set V is < 

i < m,l < j < n} and the edge set E is < i < m,l < j < 

n} U {(ri ], < i < m,l < j < n\. We propose a graph which has two 
wraparound edges in the first row and the last row in M{m,n). 

Definition 1. Let M{m,n) = (V,E). M2{m,n) is defined as {Vm 2, EM2), where 
the vertex set Vm2 = P o,nd the edge set Em2 = EU {("yi , (v^,v^)}. 

The vertices of M2{m,n) are colored with black and white as follows: is 

called a black vertex ii i + j is even; otherwise it is a white vertex. We denote by 
R{i) and C{j) the vertices in row i and column j, respectively. That is, R{i) = 
j < n} and C{j) = {fjll < i < m}. We let R{i : j) = Ui<k<jR{k) if 
i < j; otherwise R{i : j) = 0 . Similarly, C{i : j) = Ui<k<jC{k) if i < j; otherwise 
C{i : j) = 0 . 

We denote by H[s,t\X] a hamiltonian path from s to t in the subgraph 
G {X) induced by a vertex set X, if any. A path is represented as a sequence of 
vertices. If X is an empty set, iL[s, tlX] is an empty sequence. We denote by Vj 
v], a path (uj, uj+i, • • • , v],_^,v],) if j < /; otherwise, 

Similary, u* — > a path from ri* to v'- in the subgraph G {C{j)). We employ 

three lemmas on the hamiltonian properties of M{m,n) and Pm x Gn- We call 
a vertex in a mesh a corner vertex if it is of degree two. 

Lemma 1. [ 10 ] (a) If mn is even, then M(m,n) has a hamiltonian path from 
any corner vertex v to any other vertex with color different from v. (b) If mn 
is odd, then M(m,n) has a hamiltonian path from any corner vertex v to any 
other vertex with the same color as v. 
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Lemma 2. [5] Let two vertices s,t have different color each other, (a) Ifm,n > 
4 and mn is even, then M{m,n) has a hamiltonian path joining s and t. (b) If 
m = 2, n > 3, and s ,t ^ C {k){2 < k < n — 1) , then M{m, n) has a hamiltonian 
path joining s and t. 

Lemma 3. (a) For m > 2,n > 3 odd, Pm x (7„ is hamiltonian- connected and 
1-fault hamiltonian [1]. (b) For m > 2,n > 4 even. Pm x Cn is 1-fault strongly 
hamiltonian-laceable[2] . 

Let P and Q be two vertex-disjoint paths (ai, 02, • • • , Ofc) and (61, 62, • • • ,6;) 
in a graph G, respectively, such that (oj, 61) and (fli+i, 6;) are edges in G. If we 
replace (oj, Ui+i) with (oj, bi) and (ui+i, 6/), then P and Q are merged into a sin- 
gle path (oi, 02, • • • , Oi, 61, 62, • • • ,bi, Oi+i, • • • , Ofe). We call such a replacement 
a merge of P and Q w.r.t. (oi,6i) and (oi+1,6/). If P is a closed path(that is, a 
cycle), the merge operation results in a single cycle. We denote by V (P) the set 
of vertices on a path P. 

To show the fault hamiltonicity of M2 (to, n), we first show some hamiltonian 
properties of Mi(m,n) which has a single wraparound edge on the first row in 
M{m,n). An Mi{m,n) has two corner vertices and v'^. 

Lemma 4. For m > 2, n > 3 odd, Mi{m,n) has a hamiltonian path between 
any corner vertex s and any other vertex t. 

Proof. The proof is by induction on to. Without loss of generality, we assume 
that s = wj". First, we observe that the lemma holds for to = 2. For t € B, there 
exists a hamiltonian path by Lemma 1; otherwise we can construct a hamiltonian 
path P = (s,vl, H[vl„t\G(2 : n)]). By Lemma 1, H[vf_,t\G{2 : n)] exists. 

For TO > 3, we assume that the lemma is true for every k < m. The proof is 
divided into two cases. 

Case 1: t G R{1 : m — 1). When t , we can construct a hamiltonian 

path P = {s,vlff u™, H[v'ff~^ ,t\R{l : to — 1)]). By induction hypothesis, 
H[vff-^ ,t\R{\ : TO- 1)] exists. When t = , P = (s, v^~^ vl,H[s', t\G{2 : 

n)]), where s' is v^^ if to is odd; s' is v\ if to is even. 

Case 2: t G R{m). Let t = v™ . By induction hypothesis, there exists a hamil- 
tonian path P' joining and in G(i?(l : to — 1)). We construct a 

hamiltonian path P = (s, vff vff^,P' , v'ff , vff_Y ^ t). □ 

In the following lemmas, we just summarize the results and omit the proofs. 
Lemma 5. For to > 2, n > 4 even, Mi{m,n) is strongly hamiltonian laceable. 

Lemma 6. For to > 2,n > 4 even, Mi{m,n) with a single faulty vertex Vf has 
an L°'P*-path joining a corner vertex s and any other vertex t if s has a different 
color from Vf and at most one of Vf and t is adjacent to s. 

3 Hamiltonian Properties of M 2 {m,n) 

3.1 M 2 {m,n) with Odd n 

When n is odd, M2 (to, n) is not bipartite. We show that M2 (to, n) is hamiltonian- 
connected and 1-fault hamiltonian. 
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Theorem 1. For m > 2,n > 3 odd, M 2 {m,n) is hamiltonian- connected. 

Proof. The proof is by induction on m. M2(2, n) is isomorphic to P 2 x Cn. Thus, 
the theorem is true by Lemma 3 when m = 2. For m > 3, we assume that the 
theorem is true for every k < m. Let s = vf, t = uj. We show that M 2 {rn,n) 
has a hamiltonian path P between s and t. The proof is divided into two cases. 
Case 1: s,t G i?(l : to — 1). If we assume that a virtual edge 
exists, then there exists a hamiltonian path P' joining s and t in G (i?(l : to — 1)) 
by induction hypothesis. If P' passes through the edge then we 

can construct a hamiltonian path P by replacing with a path 

^ otherwise we choose an edge (u,v) in G {R{m — 1)) 

such that P' includes it. Let u' and v' be the vertices in R{m) adjacent to u 
and V, respectively. Since {R{m)) forms a cycle, it has a hamiltonian path P” 
joining u' and v' . By a merge of P' and P" w.r.t. (u,u') and (v,v'), we have a 
hamiltonian path. 

Case 2: s G P(1 : to — 1) and t G R{m). When s G R{2 : to — 1), this case is 
symmetric to Case 1. Thus, we only consider the case that s G P(l). 

Case 2.1: m = 3. If either s or t is on the first column or the last column, 
then there exist a hamiltonian path by Lemma 4. Otherwise(that is, s,t G G(2 : 
n — 1)), by Lemma 1 and Lemma 2, we can construct a hamiltonian path P as 
follows: 

(i) s,t G W, P = {s,v}_^_^ v^,vl vl_j^,H[vf_j^,vf\R{2 : 3)] C G(1 : j - 
l)],H[vlt\R{2:3)nCU-.n)]). 

(ii) s G P and t G W, P = {H[s, u?|G(l : i)],P[[vf^, t\C{i + 1 : n)]). 

(hi) s,t G B, 

P=/ iH[s,v}^^\C{l :i+l)],H[v}^.^,t\C{i + 2:n)]) i j 

\ (s,P[u|_i,u,Li|G(l : i - l)],vl + I : n)],t) if i = j 

Case 2.2: to > 4. By Lemma 4, there exist two paths P' = {H[s,v^~^\R{l : 
TO — 2)]) and P" = t|P(TO — 1 : to)]). (P',P") forms a hamiltonian 

path. □ 

Theorem 2. For to > 2,n > 3 odd, M 2 {m,n) is 1-fault hamiltonian. 

Proof. We prove by induction on to. Due to Lemma 3, the theorem holds for 
TO = 2. For TO > 3, we assume that the theorem is true for every fc < to, and we 
consider M 2 (to, n). Without loss of generality, we assume that the faulty element 
is contained in G (P(l : to — 1)). 

If we assume that a virtual edge (u]"“^,u™“^) exists, then there exists a 
fault-free hamiltonian cycle G' in G(P(1 : to — 1)) by induction hypothesis. If 
G' passes through u™“^), then we can construct a hamiltonian cycle 

G by replacing with a path ^ If G' does 

not pass through we choose an edge (u,v) in G {R{m — I)) such 

that G' includes it. Let u' and v' be the vertices in P(to) adjacent to u and v, 
respectively. Since G (P(to)) forms a cycle, it has a hamiltonian path P' joining 
u' and v' . By a merge of C' — (u,v) and P' w.r.t. {u,u') and we have a 

fault-free hamiltonian cycle G. □ 
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3.2 M 2 {m,n) with Even n 

When n is even, M 2 {m,n) is bipartite. First, we show that M 2 {m,n) with a 
single faulty vertex is strongly hamiltonian-laceable. 

Lemma 7. For n > 4 even, M2(3,n) with a single faulty vertex is strongly 
hamiltonian laceable. 

Proof L°P*-paths can be constructed for all cases: i) s,t G R{1 : 2), ii) s G R{1 : 
2),t G i?(3), iii) s,t £ i?(3). The details are omitted. □ 

Lemma 8. For m > 2,n > 4 even, M 2 {m,n) with a single faulty vertex is 
strongly hamiltonian laceable. 

Proof. The proof is by induction on m. For m = 2 and m = 3, the lemma is true 
by Lemma 3 and Lemma 7, respectively. For m > 4, we assume that the lemma 
holds for every k < m, and we consider M 2 {m,n). Let s = vf, t = Vj. Without 
loss of generality, we assume that a faulty vertex Vf £W and Vf G R{1 : m — 2). 

Case 1: s,t G i?(l : m — 1). Similar to Case 1 in Theorem 1, we can constructed 
an L°P*-path. 

Case 2: s G i?(l : m — 1) and t G R{m). 

Case 2.1: s £ R{1 : m — 2). We choose a black vertex s' which is one of the two 
vertices and If s = s' or s' is adjacent to both Vf and s, then let 

s' be the black vertex in R{m — 2) n C(2 : n — 1). There exists an L°P*-path P' 
joining s and s' in G (i?(l : m — 2)) as follows. When either s or s' is u™“^(resp. 

and m is even(resp. odd), P' exists by Lemma 6. Otherwise P' can be 
constructed as follows: 

{ {H[s,vl,_i\R{l : m — 3) n C{n — 1 : n)], 

p[[v\_ 2 , : TO — 2) n C(1 : n — 2)]) if to is even 

: TO-3)nC(l : 2)], 

iL : TO — 2) n C(3 : n)]) if to is odd 

Let t' be the vertex in R{m — 1) adjacent to s' . By Lemma 5, there exists an 
L°P*-path P" joining t' and t in G {R(jn — 1 : to)). P' and P" form an L°P‘-path. 

Case 2.2: s G R{m — 1). If Vf is in R{2 : m — 2), then this case is symmetric 
to Case 1. Thus, we only consider the case that Vf £ R{1). We choose two 
vertices u and v as follows. When s,t £ B, let u = vff~‘^ and v = if to is 
even; otherwise u = v'f'~'^ and v = vffZ\ . When at least one of s or t is white, 
u = and v = vff~'^ . Let v! and v' be the vertices in R{m — 1) adjacent 

to u and v, respectively. By Lemma 6, there exists an L°P*-path P' joining 
u and V in G {R{1 : m — 2)). Let P" and P'" be two vertex-disjoint paths in 
G{R{m— 1 : to)) such that V{P") U V{P'") = R{m — 1 : to) and they are 
joining s and u' , v' and t, or s and v', v! and t, respectively. We can construct 
an L°P‘-path P = (P" , P' , P'"). P" and P'" can be constructed as follows: 
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Without loss of generality, we assume that m is even. 

Case 2.2.1 i < j. (i) s,t G B, 

Us,u') iftec(2) 

1 u'\C{l : i) n R{m — 1 : m)]) if t € C(3 : n) 

I {H[v' : n) n R{m — I : m)],v'^,t) if t € C(2) 

^ = I (skflco: + 1 : n) n ii(m - 1 : .’)1) if * £ CP : ») 

(ii) s GW and t G B, P" = {H[s, u'\C{l : i) n R{m — 1 : m)]) and 
P'" = {H[v' , t\C{i + 1 : n) n R{m — 1 : m)]). 

(iii) s G B and t S W, P" = (s, ^ 

P'" = (ff[v',v^lC(j + 1 : n)nR(m-l : m)],vY^ , H[v^^,t\C(i + I : 

j) n R{m — 1 : m)]). 

(iv) s,t GW. P” = (i^ [s, u'|C(l : j — 1) n R{m — 1 : m)]) and 

P'” = + 1 : n) n R{m — 1 : m)],t). 

Case 2.2.2 i > j. Similar to Case 2.2.1, P" and P'" can be constructed. The 
details of P" and P'” are omitted. 

Case 3: s,t G R{m). If Vf is in R{2 : m — 2), then this case is symmetric 

to Case 1. Thus, we only consider the case that Vf G The same way as 

Case 2.2, we can construct an L°P*-path P except the case that s G B and 
t G W . We only show the case that s G B and t G W. An L°P*-path P' in 
G (i?(l : m — 2)) can be obtained by using the same way as Case 2.2, and two 
vertex-disjoint paths P” and P'” in G {R{m — 1 : to)) can be constructed as 
follows: P” = {P[[s,vJ‘~i\G{i : j — 1) C R{m — 1 : m)],vj^~^ v') and P'" = 

(i7K,uj"|C'(l : t - 1) n R{m - 1 : to)],< ^ t). (P",P',P'") forms an L°p‘- 
path. □ 

Lemma 9. For to > 2, n > 4 even, M 2 (to, n) with a single faulty edge is hamil- 
tonian laceable. 

Proof. We prove by induction on to. Due to Lemma 3, the lemma holds for 
TO = 2. For TO > 3, we assume that the lemma is true for every k < m, and we 
consider M 2 {m,n). There exists a hamiltonian path by Lemma 2, if e/ is one 
of {v\,v^) and Let s = vf and t = Vj . Without loss of generality, we 

assume that the faulty edge Cf G G (i?(l : to — 1)). 

Case 1: s,t G R{1 : to — 1). Similary to Case 1 in Theorem 1, we can construct 
an L°P* path. 

Case 2: s G i?(l : to — 1) and t G R{m). 

Case 2.1: e/ = {vf,,vp~^). When x > z, there exists a hamiltonian path P' 
joining s and t in G {R{z + 1 : to)) by Lemma 5. We choose an edge (u,v) in 
G {R{z+ 1)) such that P' includes it and neither u nor v is Let u' and 

v' be the vertices in R{z) adjacent to u and v, respectively. By Lemma 5, there 
exists a hamiltonian path P” joining u and u in G (i?(l : z)}. By a merge of P' 
and P" w.r.t. {u,u') and (v,v'), we have a hamiltonian path P. When x < z, 
we choose a vertex s' in R{z) such that s' has a different color from s and is 
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not v^. Let t' be the vertex in G {R{z + 1)) adjacent to s' . By Lemma 5, we can 
construct a hamiltonian path P = {H[s, t\R{z + 1 : to)]). 

Case 2.2: e/ = (ffc,Ufe_|_i). Without loss of generality, P can be constructed 
according to the three cases, (i) i < k < j, (ii) i < j < k, and (iii) k < i < j. We 
only show the case that i<k<j,sGB, and t G W. Proofs of other cases are 
omitted. 

a) For k = 1, li X > z, then s G B and s G C'(l). 

P = (s^ vl,H[vlv-_,\C{2 : n) n i?(l : x)], H[v]+lt\R{x + 1 : to)]) 

If a; < z, 

r (s ^ ^ v^,H[vr\vr"\R{l : x - 1)], 

_ I iL [wf , t\C{3 : n) n R{x : to)]) if j = n 

" I (s^vr,v:^^v-,H[v--\v:zl\R{l-.x-l)], 

[ iL[w^_i, ^1(7(2 : n — 1) n i?(a: : to)]) if j yf n 

b) For 1 < fc < n — 1, 

f {H[s,v;;;\C{l : k)],H[v;^^^,t\C{k+l : n)]) if j = n 

1 : k)],H[v^,t\C{k + l : n)]) if j yf n 

Case 3: s,t G R{m). 

Case 3.1: e/ = By Lemma 5, there exists a hamiltonian path P' 

joining s and t in G {R{z + 1 : to)). We choose an edge {u, v) in G {R{z + 1)) 
such that P' includes it and neither u nor v is Let u' and v' be the 

vertices in R{z) adjacent to u and v, respectively. In G (i?(l : z)), there exists a 
hamiltonian path P" joining u' and v' by Lemma 5. By a merge of P' and P" 
w.r.t. {u,u') and (v,v'), we have a hamiltonian path. 

Case 3.2: e/ = Without loss of generality, we assume that i < j. 

Case 3.2.1: i < j <k. When s G B and t G W, 

' (iL[s,u]"lG(l : j - 1) n i?(2 : m)], H[v^ ,v^\C{k + I : n)], 
v\ v^_i,H[v^,t\C{j : A:)]) if to is even 

P=l (iL[s,z;2_ilG(l:j-l)ni?(2:TO)], 

H[v],v]\C{j : A:) n i?(l : TO - l)],fj_i ^ v\, 

iL[u)j,u^JG(A: + 1 : n)],w™ ^ A) if to is odd 

When s GW and t G B, 

' (s ^ vY',H[v^,vl^\G{k+l : n)], 

H[vlv'^-^\G{1 : fc)ni?(l:TO-2)],<-i 
p A|C'(* + 1 : A:) n R{m — 1 : to)]) if m is even 

^ = {H[s,v]!_-,^\C{1 : J - 1) n i?(TO - 1 : m)],v]'-^ ^ 

vl\G{l : fc) n i?(l : TO - 2)], 

^ H[vl,v™_^-^^\C{k + 1 : n)],vJIZ ^ t) if to is odd 

Case 3.2.2: i < k < j. We can construct a hamiltonian path P as follows. When 
TO is even, let s' = uj" and t' = u™ if s G B; s' = v\ and A' = if s G W. 
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When m is odd and s G B, let s' = and t' = if k is even; s' = v\ 

and t' = v\ ii k is odd. P = s'|C(l : k)], H[t' ,t\C{k + 1 : n)]). When m 
is odd and s G W, P = {H[s,v^\C{i : k)],v^^-^ ^ v^, H[vl,vl\C{l : 

i - l)],H[v^,t\C{k + 2 : n) n P(2 : m)]). □ 

Theorem 3. For m > 2,n > 4 even, M 2 {m,n) is 1-fault strongly hamiltonian 
laceahle. 

Proof. By Lemma 8, M2(m, n) with a single faulty vertex is strongly hamiltonian 
laceable. By Lemma 9, M 2 (m, n) with a single faulty edge has a hamiltonian path 
between any two vertices with different colors. It remains to show that there is 
an L°P*-path (of length mn — 2) joining every pair of vertices s and t with the 
same color. Let {u, v) be the faulty edge. Without loss of generality, we assume 
that u G B and v G W. When s and t are black, we can find an L°P*-path P 
between s and t regarding z; as a faulty vertex by using Lemma 8. P does not 
pass through {u,v) as well as v, and the length of P is mn — 2. Thus, P is a 
desired P°P*-path. In a similar way, we can construct an P°P*-path for a pair of 
white vertices. □ 

4 Fault Hamiltonicity of Hypercubes 

An n-dimensional hypercube, denoted by Q„, consists of 2” vertices that can be 
represented in the form of binary strings, bnbn-i---b\. Two vertices are adjacent 
if and only if their labels differ in exactly one bit. An edge is referred to as an i- 
dimension edge if the vertices it connects differ in bit position i. A /c-dimensional 
subcube in Qn is represented by a string of n symbols over set {0, 1, *}, where * 
is a don’t care symbol, such that there are exactly k *’s in the string. 

Lemma 10. For /e < n — 3 and l<r<n — /e — 2, with fe faulty edges 
has M2(2’’, 2”“’’) as a spanning subgraph. 

Proof Let P = {1, 2, ..., n} be the set of dimensions in Qn, Df = {/i, /2, fi} 
be the set of dimensions which contain faulty edges, and Dg = D — Df = 
{si, S2, Sj}- Since /e < n — 3, we have |Ps| > 3. 

If we replace 5^^ , 6^2 , ..., bs,. bits of each vertex label in Qn by then can 
be partitioned into 2”“’’ r-subcubes and theses subcubes form Qn-r by replacing 
each subcube as a vertex. We denote such a graph by a condensation graph Qn-r 
of Qn- Let u, V be the vertices in Qn, and C„, Cy be the components containing 
u and V, respectively. We assume that an edge (C„,C„) of Qn_r is faulty if 
(u,v) is faulty. For all n > 2, Qn with n — 2 faulty edges has a hamiltonian 
cycle[ll]. Since fe<n — r — 2, there exists a hamiltonian cycle in Qn-r- Let 
C = {xi,X 2 , ---, Xd, ---, X2"-r, xi) be a hamiltonian cycle in Qn-rt where 1 < d < 
2”“’’. Each vertex of Qn can be mapped to of M2{m,n) as follows: 

In bnbn-i---b\ of each vertex label of Qn, 

(i) bs^bs 2 ---bsy is the fc-th sequence of r-bit Gray code, and 

(ii) (n — r)-bits (except bs^ , bg^, ---, bg^ bits) represent the label of the d-th 
vertex in C . 

Thus, Qn has fault-free M2{2'' , 2”“’’) as a spanning subgraph. □ 
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By applying fault-hamiltonicity of M2{m,n) to a hypercube, we have the 
following theorem. 

Theorem 4. with f < n — 2 and fv<lis strongly hamiltonian laceable. 

5 Conclusion 

In this paper, we considered the fault hamiltonian properties of m x n meshes 
with two wraparound edges in the first row and the last row. We showed that 
M2{rn,n) with odd n is hamiltonian-connected and 1 -fault hamiltonian. 
M2{rn,n) has these hamiltonian properties by adding minimum edges to an 
mx n mesh. For n > 4 even, we showed that M2 (to, n) is 1 -fault strongly hamil- 
tonian laceable. By applying fault hamiltonicity of M2 (to, n) to a hypercube, we 
obtained that Q„ with at most n — 2 faulty elements and at most one faulty ver- 
tex is strongly hamiltonian laceable. Also, our results can be applied to the fault 
hamiltonian properties of other interconnection networks which has M2 (to, n) as 
a spanning subgraph. 
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Abstract. In this article we investigate the expected time for Herman’s 
probabilistic self-stabilizing algorithm in distributed systems: Suppose 
that the number of identical processes in a ring, say n, is odd and n > 
3. If the initial conhguration of the ring is not “legitimate”, that is, 
the number of tokens differs from one, then execution of the algorithm 
made up of synchronous probabilistic procedures with a parameter 0 < 
r < 1 results in convergence to a legitimate configuration with a unique 
token (Herman’s algorithm). We then show that the expected time of the 

2 o o 

convergence is less than . Moreover there exists a configuration 

whose expected time is 0{n^). The method of the proof is based on the 
analysis of coalescing random walks. 

Keywords: probabilistic self-stabilization, Herman’s algorithm, random 
walk, coalescing time. 



1 Introduction 

Distributed systems based on self-stabilizing algorithms were introduced by Di- 
jkstra [5]. These propose an elegant way of solving the problem of fault-tolerance 
(see also [6, 11]). In general, self-stabilizing algorithms have that arbitrary con- 
figuration reaches “legitimate” configuration satisfying desired property in finite 
times. Moreover probabilistic methods are often studied as solutions to problems 
for distributed computing in order to improve the efficiency of deterministic al- 
gorithms (see [6]). One of the useful probabilistic self-stabilizing algorithms was 
proposed by Herman [8]. The basic setting is the following: Suppose that the 
number of identical processes in a ring, say n, is odd and n > 3. If the initial 
configuration of the ring is not legitimate, that is, the number of tokens differs 
from one, then execution of the algorithm made up of synchronous probabilistic 
procedures with a parameter 0 < r < 1 results in convergence to a legitimate 
configuration with a unique token (Herman’s algorithm). 

For the algorithm we then show that the expected time of the convergence is 
less than Moreover we also give a configuration whose expected time 
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K.-Y. Chwa and J.I. Munro (Eds.): COCOON 2004, LNCS 3106, pp. 422—431, 2004. 
© Springer- Verlag Berlin Heidelberg 2004 




On the Expected Time for Herman’s Probabilistic Self-stabilizing Algorithm 423 



is 6>(n^). The method of the proof is based on the analysis of the expected time 
of coalescing random walks. 

A model of coalescing random walks was proposed by Aldous and Fill [2, 
Chapter 14] . They formalized them using continuized chains with 1-exponential 
distributions. Note that many quantities are unchanged by the passage from 
the discrete time chain to the continuized chain. Their setting is the following: 
Fix a connected graph G with order n. Consider independent random walks 
on each vertex. When some of them meet, they coalesce into a new random 
walk. What is the expected time that all walks coalesce into a single random 
walk? They showed that the time is less than e(logn + 2)Tg, where Tq is the 
maximal (in the meaning of start and goal points) expected hitting time of G. 
Brightwell and Winkler [3] showed that Tq is bounded by (4/27-|-o(l))n^, which 
is attained by some lollipop graphs. Moreover Coppersmith et al. [4], Tetali and 
Winkler [13] studied the meeting time, which is the expected number of steps 
that two independent random walks collide on a connected graph. They proved 
that Mg < {4:j27 + o(l))n^, where Mg is the maximal (in the meaning of 
start points) meeting time on a connected graph G with order n. Furthermore, 
Hassin and Peleg [9] applied the results of meeting times to an analysis of the 
expected absorption time in distributed probabilistic polling. In their setting 
they pointed out that the time is equivalent to the expected time of coalescing 
random walks. They showed that the time is 0{Mg log n). Their key idea of the 
estimate is the following: Since Mg is the maximal meeting time, at least n/2 
random walks have to meet after the time Mg ■ By repetition the results can be 
obtained. Recently Adler et al. [1] showed good estimates of expected coalescing 
times for iid random variables without considering graph structures. They noted 
applications to population biology. 

For expected times of Herman’s self-stabilizing algorithm we also apply the 
method of coalescing random walks. We regard tokens in the ring as independent 
random walks. Though in our case random walks do not coalesce but disappear 
when two random walks collide, we can see that the expected times do not change 
essentially. Noting that both Mn-ring and Tn-ring are 6>(n^), we can drop logn 
term for Aldous & Fill’s/Hassin & Peleg’s results. Moreover Aldous and Fill [2, 
Chapter 14 §3] conjectured that there exists an absolute constant K such that on 
any graph G the expected time of coalescing random walks is bounded by KTg- 
Hence our estimate would be one of the positive example for their conjecture. 
To estimate the expected time of coalescing random walks, we are aiming at not 
the maximal meeting time but the minimal meeting time on the ring. 

The plan of the paper as follows: After some preliminary about Herman’s 
algorithm (Section 2) , we state the estimate of the expected time of Herman’s 
algorithm in Section 3.1. The proof is given in Section 3.2. Moreover showing 
future works, we conclude in Section 4. The rough scenario of estimates in Section 
3.2 is the following: 

(i) We formalize tokens in the ring as independent random walks (Lemma 2). 

(ii) Noting the minimal meeting time of random walks of Item (i), we investi- 
gate correspondent one dimensional random walks (Lemma 3). 
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(iii) To estimate hitting times of one dimensional random walks of Item (ii), we 
use the coupling random walk whose expected time can be gotten by some 
difference equations (Lemmas 4 and 5). 

(iv) Using the hitting time of Item (iii), we estimate the desirable expected 
time. Moreover we give a simple example satisfying Eqn. (9). 



2 Herman’s Algorithm 



Let L = {0, • • • , n — 1} ~ 7i/n7i be a process ring, where n is an integer satisfying 
n > 3. Henceforth any index is defined for any integer subscript with modulo 
n. The state of each process is a single bit in {0, 1}, so that we denote by 
X = {0,1}" the configuration space whose element is represented by a vector 
of n bits X = (xo, • • • , Xn-i) G X. For a given 0 < r < 1 a random procedure 
/ : T — > T is defined by 



!W> = fr{x) = y= {yo,-“ ,yn-i) & X, yz = 



Xi—\ if Xi yf Xi—\ 
7i if Xi = Xi-1, 



( 1 ) 



where (z = 1, • • • , n) are independent random variables with each distribution 



Pr( 7 i = Xi) = 1- Pr( 7 i = I - Xi) = r for z = 0, • • • , n - 1. (2) 



For a given x = (xq, • • • , Xn-i) G X, we define a local state to be any consecutive 
pair {xi-i,Xi): 

— ii Xi = Xi-i, then there is a token at process z. (We will also say that Xi is a 
token.) 

— ii Xi ^ Xi-i, then there is a shift at process i. (We will also say that Xi is a 
shift.) 

Note that if n is odd, the number of tokens is also odd, that is. 



||a;|| mod 2 = n mod 2 for a; G X, (3) 

where || • || is the number of tokens in the ring. For each x = (xg, • • • , a;„_i) G X 
define the set of tokens/shifts as follows: 

Lt = Lt{x) = {\ < i < n ■. Xi = Xi-i}, (4) 

Ls = Ls{x) = L \ Lt = {i < i < n : Xi Xi-i}. 

Moreover we also define X^ = {x & X : \Lt{x)\ = k} = {x G T : ||a;|| = k}. 
We call the element of X^ a legitimate configuration. Use notation f*{x) = 
f{f*~^{x)) for t > 1, where f^{x) = x. Herman proved the following theorem: 

Theorem 1 (Herman). For the random procedure f defined by Eqn. (1), the 
following statements hold: 

(i) The number of tokens does not increase: 



Pr(||a:||>||/(x)||) = l. 



( 5 ) 
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(a) If there exists a unique token ati, then the token is eventually ati + \: For 
each X € with Xi = Xi-i 

Pr(3/c > 0 s.t. y = f’"{x) g y* = y*+i) = 1. (6) 

(Hi) If there are more than two or more tokens, the number of tokens is even- 
tually strictly decreasing: 

Pr(3fc > 0 sT. ||/'’'(a;)|| < ||a;||) = 1 for\\x\\>2. (7) 

Remark 1. (i) Suppose that n is odd. Then by Eqn. (3), the number of tokens 

is also odd. By Eqns. (5) and (7) we see that the number of decreasing 
tokens is even (including 0), and at last there will hereafter become a 
unique token. 

(ii) By Eqn. (6), the algorithm progresses in a legitimate configuration, that 
is, the token circulates in the ring. 

3 Expected Times for Herman’s Algorithm 

3.1 Main Results 

Fix an odd integer n > 3. Then by Item (i) in Remark 1, we see that for x € X 
there exists t = t{x) < oo with probability 1 such that 

Ikll > ||/(x)|| > ■ ■ • > \\f-\x)\\ > \\f{x)\\ = =■■■ = !■ 

Now we analyze the above random time t for any x € X. 

Theorem 2. Fix an odd integer n > 3. Then for any parameter 0 < r < 1 
of Eqn. (2), the expected time to become a legitimate configuration is less than 



maxE [infjt > 0 : f*{x) g Af^}] < 


TT^ — 8 2 


(8) 


8r(l — r) 


Moreover there exists x € X such that 






E[inf{t > 0 : f*{x) g X^}] = 


0{n^). 


(9) 



3.2 Proof of Theorem 2 

For a fixed 3 < k < n, we consider a configuration with k tokens g Then 
we define the first decreasing time for g Af^: 

T{x^) = inf{t > 0 : f{x^) ^ X^}. (10) 

To investigate the time, we define the minimum token distance: For x^ g X^, 
remembering Eqn. (4), 

D{x'^) = I 



min{(i(f, j) : i ^ j ^ Lt{x'^)}, if fc > 2 
0, if fc = 1 ’ 
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where d{i,j) denotes the minimum distance between i and j, that is, 

d{hj) = min{(z — j) mod n, (j — i) mod n}. (11) 

Then it is easy to see that 

D{x^) < [n/k\ for . (12) 

In fact if D{x^) > \n/k\ then any distance of two tokens is greater than n/k 
then the sum of these distance is greater than n. Now the repetition of a ran- 
dom procedure / implies a stochastic chain D{p{x^)) for t = 0,1,2,--- on 
{0, 1, • • • , \ n/k\}. Lemma 3 will claim the statistical property of D{f{x^)). Defin- 
ing 

Tfc = max E[T(a:^)], (13) 

x>‘GX>‘ 

we investigate the expected time that configuration will be legitimate, which 
is defined by 

Tfc = max E [inf{t > 0 : f*{x^) € X^}] = max E [inf{t > 0 : = 0}] . 

Noting Item (i) in Remark 1, we see that Tfc can be bounded by as 

follows: 

Lemma 1. For an odd integer n> 3, assume that there exist 3 < k < n tokens 
in an n-ring. Then we have 



(fc-l)/2 

Tfe< ^ T2/+1. (14) 

/=! 

Proof. Since n is odd, k is also odd by Theorem 1. Now we suppose that the num- 
ber of tokens will be fci at the first decreasing time T{x^). Putting k = kg, simi- 
larly suppose that the number of ki tokens will be at T{x^') for i = 0, ■ ■ ■ ,1, 
and = 1. Note that k = ko > ki > ■ ■ ■ > ki > ki+i = 1 are well-defined by 
Item (i) in Remark 1. It is clear that ki depends on x^ for i = 1, - ■ ■ ,1. By the 

definition of Tfc, we have Tfc = max^.fcg^^'^ E [T{x’^°) + T{x^^) + h T(a:^‘)] . 

Since ki is odd for z = 0, 1, • • • , ^ and ki > 3, we obtain 

Tfc < max E[T(a;^)] -I- max E[T(a;^“^)] -I- • • • -I- max E[T(a;^)] 

x^ex^ 

= Xk + Tfc_2 -l- • • • -l- T3. □ 

Computing local probability of the position of each token, we will estimate 
Ti for i = 3,5, • • • , k. The movement of each token in the ring is characterized by 
the following random walks: 

Lemma 2. For an odd integer n>3, assume that there exist 3 < k < n tokens 
in an n-ring. Putting each position of k tokens of x^ as 

Sq, ■ ■ ■ ,Sq G Lt{x^) sequentially. 



(15) 
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consider independent k random walks Sl, ■ ■ ■ , defined by Eqn. (15) and 
PfiSl = i + =i) = l- PfiSl = =i) = r 

(16) 

where t* is the first coalescing time of that is, 

t* = t*{x^) = inf{i > 0 : l<^i<^j<k s.t. S) = S( }. (17) 

Then the process of the position of tokens is equivalent to the process of S), - , Sf 

for 0 < t < t* — 1. Moreover the distribution for t* is equivalent to the one of 
T{x^) defined by Eqn. (10). 

Proof. For x^ = x = (xq, ■ ■ ■ , x„-i) € , we will show this lemma with either 

D{x) > 2 or D{x) = 1. 

Assume that D{x) > 2, that is, if z G Lt{x) then z — 1, z -b 1 G Ls{x). By the 
definition of the shift/token, we have that Xi-i = Xi, Xi -2 Xi-\ and Xi Xi+\. 
Therefore we obtain that 

Pr({f G Ls{f{x))} n {z -b 1 G LT{f{x))}) 

= 1 - Pr({z G Lrifix))} n {z -b 1 G Ls{f{x))}) = r 
by Eqns. (1) and (2). Hence we see that 

Pr(ll/(a;)|| = ||a;||) = 1 for Zl(a;) > 2, (18) 

and the local transition probability of the position of the token is equivalent to 
Eqn (16) for D{x) > 2. 

Next suppose that D{x) = 1, that is, there exists 2 < m < k satisfying that 
i,i + 1, ■ ■ ■ ,i + m — 1 G Lt{x), i — l,i + m G Ls{x). Then we see that Xi-\ = 
Xi = Xi+i = ... = and Xi -2 yf Xi-i,Xi+rn-i Xi+rn- Putting y = 

{yo, ■ ■ ■ ,z/n-i) = f{x), we have that pi-i yf yi+m- On the other hand, assume 
that continual m particles at time t—1, that is, S(_i = i, Sf_i = z-b 1, • • • , Sffi = 
z -b m — 1, so that we regard these particles as tokens. At the next step, consider 
that there is a token (resp. shift) if the number of particles of on 

each position z < j < z -b m is one (resp. zero or two). Noting that pj is a token 
if Pj -I = pj for j = i, - ■ ■ ,i + m, we have that the distribution of process for 
continual m are equivalent to the distribution of S), - , S'™ for 0 < t < — 1. 

It is clear that the equivalence of the law of T{x) and the first collision time 
t*{x). □ 

By Lemma 2 we see that the process of each token is simulated by the ran- 
dom walk defined by Eqn. (16). We will give some estimate of the transition 
probability for D{f{x)) as the following lemma: 

Lemma 3. For an odd integer rz > 3, assume that there exist 3 < k < n tokens 
in an n-ring. Then the following statements hold: 

(i) If a configuration x G satisfies D{x) = 1 then there exist Px, qx such 
that Px ^ — r) < qx < f — Px and 

Pr {{D{f{x)) = 2} n {||/(a;)|| = k}) = Px, Pr(||/(a:)|| < k) = qx, 
Pr{{D{f{x)) = 1} n {||/(a;)|| = k}) = 1 - Px - qx- 




428 



Toshio Nakata 



(ii) If a configuration x G satisfies D{x) > 2, then f{x) G X’^ . 

(i) If ^ ^ \x!^\ ~ 1 then there exist Px, Qx such that Px < 

r{l - r) < Qx < 1 - Px and 

Pr{D{f{x)) = D{x) + 1) = Px, Pr{D{f{x)) = D{x) - 1) = qx, 
Pr{D{f{x)) = D{x)) = 1 - Px - Qx- 

(ii) If D{x) = \ n/k\ > 2 then there exists r(l — r) < qx < ^ such that 

Pr{D{f{x)) = [n/k\ - 1) = qx, Pr(D(/(a;)) = [n/k\) = 1 - 
The rough sketch of estimating transition probabilities in Lemma 3 is in Fig. 1. 



1-pi-qi l-pi-qi l-qin/ki 




0 1 2 i-li i + l ^n/k\ - 1 



Fig. 1. Illustration of the transition probabilities of D{fi{x)) in Lemma 3. Each suffix 
of the axis denotes not /*(*) G X but D{fi{x)) G {0, 1, • • • , [n/fcj }. Therefore for 
example the probability pi presents Px in Lemma 3 for some D(x) = i. Hence we see 
that 0 < Pi < r(l — r) < 5i < 1 for i = 1, • • • , [n/fej. 




Proof of Lemma 3. Fix x G X^ . By Lemma 2, we can regard the process of 
k tokens as k independent random walks defined by Eqns. (15) and (16). For 
convenience, we define = Lft™{x) = {1 G Lt{x) : i,i + D{x) G Lt{x)}, 

which is not empty. Noting that Sq, - ■ ■ ,Sq are defined by Eqn. (15), we have 
that 

Pr{d{S(, = D{x)) = 1 for any f G (19) 

By Eqn. (16), we see that 

Pr (d{Si,Sl+^) = D{x) - l) = r(l - r), Pr [d{Sl Sl+^) = D{x)) = + (1 - rf, 

Pr (d{Si,Sl+^) = D{x) + 1) = r(l - r) for any i G L?!'". 

(20) 

(i) If D{x) = 1, then using Eqn. (20) we obtain that 

Px = Pr{{D{f{x))=2}f^{\\f{x)\\ = \\x\\}) 

<Pr( f| K^),5(+i) = 2} I <r(l-r). (21) 
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Moreover we also have that 

l>g,=Pr(||/(a:)||<fc) = Pr j \J {d{Sl S{+^) = 0} 

>r(l-r). (22) 

The first inequality of Eqn. (22) holds, because Pr(||/(a:)|| = k) > 
Pr{f{x) = x) > 0. Hence Item (i) has been proved. 

(ii) Since D{x) > 2, Eqn. (18) holds. Using Lemma 2, 

Pr{D{f{x)) = D{x)) + Pi{D{f{x)) = D{x) + 1) + Pr{D{f{x)) = D{x) - 1) = 1. 

(23) 

(i) 2 < D{x) < [n/k\ — 1: Then we can prove that 

Pr(D(/(a;)) = D{x) + 1) < r(l — r) < Pr(D(/(x)) = D{x) — 1) < 1, 

(24) 

by the same method of Eqns (21) and (22). Hence by Eqn. (23), Item 
(a) has been proved. 

(ii) D{x) = [n/k\: Using the same method of Eqn. (22), we see that 

Pr(D(/(a;)) = [n/k\ — 1) > r(l — r). 

Since D{x) < [n/k\ for x € X, we have that Pr(I?(/(a;)) = [n/k\ + 
1) = 0. Hence by Eqn. (23), Item (b) has been proved. □ 

Remark 2. There exist an n-ring and a configuration x & X satisfying that 
equality holds in Eqns (21), (22) and (24) respectively. 

We will compare D{p{x^)) to the following random walks: 

Lemma 4. Consider the following Markov chain Xt on state space {0, 1, • • • , m} 
with absorbing barrier 0.' For t > 0 and z = 1, • • • , m — 1 

Pr{Xt+i = i\Xt = i) = r"^ + {1 - r)^ , 

Pr{Xt+i = i + l|Xt = i) = Pr{Xt+i = i - l|Xi = i)= r(l - r) 
and for t > 0 

Pr{Xt+i = Q\Xt = 0) = 1, Pr(W+i = m\Xt = m) = 1 - r(l - r), 

Pr(Ali_|_i = m — l|Ali = m) = r(l — r). ' 

Then the expected hitting time of the absorbing barrier 0 is 

max Efinfit > 0 : At = OllAo = zl = Efinfit > 0 : At = OllAo = ml = V ■ 

i<i<m 2r(l — r) 

Proof. Putting Oj = E[inf{t > 0 : At = 0}|Ao = j], we have that 

Oj+i - 2oj + Oj-i = for j = 1, • • • , m - 1 

ao — 0, a^ri — a.jji—1 T 

Hence aj = aj-i + for j = 1, • • • , m. Noting oq = 0, sum up the equations, 

so that we obtain Om = - r ) • 
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To estimate Tk, we use the coupling chain Xt in Lemma 4 which is defined in 
the same probability space for D{f*-{x^)). 

Lemma 5. For an odd integer n > 3, assume that there exist k tokens for 

2 

3 < k < n in an n-ring. Then we have Tk < y^T^yyp'- 



Proof Putting m = [n/k\ in Lemma 4, we define the stochastic process Xt 
satisfying Xq = D{f^{x^)) and Eqns. (25), (26), which is defined on the state 
space {0, • • • , [n/k\ }. Then we see that D{f*{x^)) is stochastically smaller than 
Xt, that is, Pr(D(/‘(a:^)) < Xt) = 1 (see [12, Chapter IV]). Therefore putting 
Tx = infjt > 0 : Vt = 0}, since [n/k\ > 1, 



Tk < E[Tx\Xo = [n/k\] 



[n/fcj([n/fcj + 1) ^ 1 (ny 

2r(l — r) ~ r(l — r) \k) 



Proof of Theorem 2. By Eqn. (14) and Lemma 5 

yr:7)(35 + 55+ - +F) = 

In fact the last inequality of Eqn. (27) holds because it is known that 



,n. 

(27) 



^ > L 7T 

^^(2jTiF = T 

(see e.g. [10]). On the other hand, for a sufficiently large n, consider the to- 
ken X = (xo, • ■ ■ , ^n-i) G with xo,XYn/Q\,XYn/ 2 \ € Lt{x). Now set random 
walks Sl,Sf,Sf with initial positions Sq = xq, Sq= Sq = a;[„/2j, and 

transition probabilities satistying Eqn. (16) respectively. We see that D{x) = 
d(S'Q,S'g) = L’^/hJ. Moreover D{p{x)) = d{S},St) for t G {t : d{Sl,Sf) < 
[n/3]}. Until the time, we have 

Vr{D{f\x)) = D{f-^{x)) + 1) = Pr{D{f{x)) = D{f-^{x)) - 1) = r(l - r), 
Pr{D{f{x)) = D{f-\x))) = + (1 - r)^. 



Remembering T{x) as Eqn. (10), we have E[T(x)] > E[T*|Zl(a;) = [n/6j], where 
T* = infjt > 0 : D{f*{x)) G {0, [n/3]}}. Now putting bi = E[r*|H(a;) = i], {i = 
0, • • • , [n/3j), the following difference equation holds: bi = {r"^ + {1 — r)^}bi + 
r(l — r)bi+i + r(l — r)bi-i + 1 for i = 1, • • • , [n/3j — 1 with boundary conditions 
bo = ^L"/3J “ solution bi = z([n/3j — z)/{2r(l — r)} 

for i = 0, • • • , [n/3j. Therefore we see that bYn/o] ^ n^/{100r(l — r)}, so that 
T3 > E[r(a;)] > n^/{100r(l — r)}. Hence we have Eqn. (9). □ 

Remark 3. By the proof of Eqn. (9), we see that a few tokens with long distances 
make the expected time large. 
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4 Concluding Remarks 

We proved that the maximal expected time of Herman’s self-stabilizing algorithm 
is less than Especially if r = 1/2 it is bounded by 0.936n^. However 

we do not know that whether r = 1/2 is optimal, which is a future work. It would 
also be of future interest to apply our argument to general connected graphs. 

Moreover note that recently Fribourg et. al. [7] characterized Herman’s algo- 
rithm with Gibbs fields. We should study the self-statilization at the viewpoint 
of statistical physics. 
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Abstract. A square is a string of characters that can be divided into two 
identical substrings. The problem of detecting squares in a string finds 
applications in many areas such as bioinformatics and data compression. 

In this paper, we give the first efficient online algorithm for the problem. 

Given any input string, our algorithm reports the first square in the 
string using 0(n log^ n) time where n is the position in the string where 
the square ends. This time complexity is only a factor of O(logn) larger 
than that of an optimal offline algorithm. 

1 Introduction 

Given a string T = (T[l], T[2], • • • , T\2k]) of an even number of characters, we 
say that T is a square if we can divide T into two identical substrings, i.e., 
if {T[l],T[2],---,T[k]) = {T[k+ l],T[k + 2],---,T[2k]). We say that a string 
T = {T[1],T[2],- ■ ■ ,T[n\) contains a square if it has a substring {T[i],T[i + 
l],---,T[j]) which is a square. It is square free if it has no such substrings. 
Detecting squares has found applications in areas such as bioinformatics [5] and 
text compression [4] . There are many efficient offline algorithms for the problem 
[1-3, 8, 10]. However, this offline property is not desirable in many applications. 
For example, suppose we want to determine whether a string of one million 
characters is square free. Note that we can stop as soon as we detect the first 
square. However, if we apply the offline algorithms, we have to scan through 
all the one million characters even when the square appears at the first few 
characters of the string. In some applications, such as online data compression, 
this offline property is even unacceptable; we need to detect a square as soon as 
a new character arrives. 

Our work on constraints satisfaction gives us another motivation for studying 
online algorithms for square detection. In [6,7,11], we studied the local search 
method for solving the classical constraints satisfaction problem. This method 
is efficient in general. However, there is no guarantee its stop; the search can be 
trapped in some infinite loop. Interestingly, it can be proved that such a loop 
corresponds to a square in some string whose characters encode the states of 
the search. To avoid wasting time in an infinite loop, we need to detect a square 
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in the string online. In other words, every time the search makes a move and a 
new character is added to the string, we detect if there is any square in the new 
string. We stop if there is a square. 

In this paper, we give the first online algorithm for square detection. Given 
any input string T, our algorithm reads T character by character. After every 
new character is read, it checks immediately whether there is a square in the 
string. If the first square ends at the nth position of T, our algorithm stops after 
reading the first n characters and report the square. The time complexity of the 
whole process is O(nlog^n). In other words, after reading a character, it takes 
0(log^ n) amortized time for square detection. The optimal offline algorithm for 
the problem [3] has time complexity O(nlogn) for a general alphabet where n is 
the length of T. Note that if the size of the alphabet is constant, offline deciding 
whether a string is square free can be solved in 0{n) [9] time. 

To explain how our algorithm works, let us suppose that we have read a string 
T = (T[l], T[2], . . . , T[n]) of n characters and find that it is square free. We have 
to continue. A new character T[n+ 1] arrives. Although for each 1 < f < n, the 
string {T[i],T[i+l], ... ,T[n]) is not a square (otherwise we would have stopped) , 
it is possible that for some i, {T[i],T[i + 1], . . . , T[n],T[n + 1]) is a square. To 
check these n strings {T[i],T[i+ 1] . . . T[n + 1]), 1 < i < n, all over again takes 
too much time. To reduce the time complexity, our idea is to identify O(logn) 
regions in the string. We show that if any one of these regions has a square, 
it will have some structural property. By using this property, we show how to 
recover the square in 0(log n). Since a square can start at anyone of the 0(log n) 
regions, we have to check them all, and the total time for such a detection is 
0(log^ n). 

The paper is organized as follows. In Section 2, we give the definitions and 
notations necessary for our discussion. In Section 3, we give a framework of our 
algorithm. In Section 4, we give details of the main step and analyze its running 
time. Finally, we propose some future work in Section 5. 

2 Definitions and Notations 

In this section, we give formally the definitions and notations necessary for our 
discussion. 

The input of our problem is a string T = (T[1],T[2], T[3],...) of charac- 
ters. Given any integers 1 < a < b, let T[a..b] denote the string (T[a],T[a-|- 
1], . . . ,T[6]). A string S = T[a..b] is a square if 

— its length ^=h— a+1 is even, and 

- T[a..{a + £/2 - 1)] = T[{a + £/2)..b], i.e., T[a] = T[a + i/2],T[a -k 1] = 

T[a + £/2+l],..., T[a + ij2 - 1] = T[b]. 

We say that T[a..{a + i/2 — 1)] is the first half, and T[{a + £/2)..b] is the second 
half of S. Given another string T[i..j], we say that S is hanging in T[i..j] if the 
first half of S falls completely in T\i..j] and the second half of S does not fall 
completely in T[i..j\. More precisely, we say that S is hanging in T[i..j] if we 
have 
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t 

i < a < a^ 1 < j < b. 

For ease of referencing, we also say that S' is a hanging square in T[i..j], 

For any h > 1, a suffix of T[l..h] is the string T[i..h] for some 1 <i < h. We 
call T[i..h] is a square suffix of T[l..h] if it is also a square. For any two strings 
X and Y, let XY denote the concatenation of X and Y, i.e., the string obtained 
by appending Y at the end of X. Hence, a square is the string XX for some X. 

3 The Algorithm 

In this section, we describe an online algorithm for detecting squares. Our algo- 
rithm reads the input string T character by character. After reading every new 
character T[h], it checks immediately (i.e., without reading T[h + 1]) whether 
the string (T[1],T[2], . . . ,T[h]) has a square. If there is a square, it stops and 
reports the square. Note that if h is the smallest integer such that T\\..h] has a 
square, then this square must be a suffix of T[\..h]. Furthermore, T\l..{h — 1)] 
must be square free. Thus, we focus on detecting a square suffix of T[\..h], and 
our analysis will make use of the fact that T[\..{h — 1)] is square free. 

The core of our algorithm is the procedure DHangSq(z, j), which solves the 
following more restrictive problem: For every j<h<2j — i + 1, after the 
character T[h] is read, DHangSq(r[i..j]) checks whether T\l..h] has a square 
suffix hanging in T[i..j\. 

Note that for all h > 2j — i + 1, there is no square suffix of T\l..h] that 
can hang in T[i..j\. In Section 4, we describe the procedure DHangSq(z,j) and 
show that it can be finished in total 0{{j — i) log(j — t)) time. In the following, 
we explain how to make use of DHangSq to solve the online square detection 
problem. 

Our idea is to run enough instances of DHangSq(z, j) for different i, j such that 
if r[l..h] has a square suffix S, then there must be an instance DHangSq(f, j) run- 
ning where S' is a hanging square of T[i..j] . Then, when T[h] is read, DHangSq(t, j) 
will detect S. Here are the details. 

For any 1 < i < j, we say that the pair (i,j) is a level-^ pair if 

— the second integer j = k ■ 2^ for some k G {1,2,3, . . .}, and 

— the first integer i = max{ 1 , k2^ — 4 • 2^ -I- 1 } . 

Following is our online square detection algorithm Dsq, whose task is basically 
to start the right DHangSq(z, j) at the right time. 

For h = 1,2, 3, after reading T[h], do the following steps: 

if there is a square in T[(h — 3)../i], or any of the running DHangSq(i, jf) de- 
tects a square in T[l..h], then stops. 
j = h] i = 0; 
while {j >2^) do 

if = k2^ for some k) 

i = max{l, k2^ — 4 • 2^ + 1}; 

for the level-^ pair start DHangSq(i, ji); 





An Efficient Online Algorithm for Square Detection 435 



(Note that Dsq is not a parallel algorithm. After reading some character T\h], 
each DHangSq(t, j) will check in turn whether T[l..h] has a square suffix hanging 
in T[i..j\.) Now, we prove the correctness of Dsq. Obviously, Dsq correctly detects 
a square suffix XX if |A'A'| < 4. The following lemma considers the other case. 

Lemma 1. Suppose h is the smallest integer such that T[l..h] contains a square 
suffix XX . Furthermore, suppose that \XX\ > 4. Then, after T[h] is read, the 
square XX will be detected by one of the DHangSq(f,j) started by Dsq. 

Proof. Since \XX\ > 4, we have [A"! > 2. Suppose that 2^ < \X\ < 2^+^ for some 
£> 1. Note that there exists integer k such that k2^ < h < {k+ 1)2^. Let j = k2^ 
and i = k2^ — 4 • 2^ + 1. Hence, (i,j) is a level-^ pair, and by construction, when 
the character T[j] is read, Dsq will run DHangSq(i, j). Below, we verify that XX 
must be hanging in T[i..j], and thus it will be detected by DHangSq(i, j) (see 
Theorem 4) . 

Note that (i) since |AT| > 2^ and h < {k + 1)2^, we have h — \X\ < k2^ = j 
and thus the first half of the square ends at some position smaller than j, and 
(ii) we have z = j — 4-2^ + l < /i — 4 - 2^+1 < h — 2\X\ + 1 and thus the starting 
position of XX is not smaller than i. Therefore, XX hangs in T[i..j], □ 

Finally, we have the following theorem. 

Theorem 1. Suppose h is the smallest integer such that T[l..h] contains a 
square. Then, for every f < i < h, Dsq correctly determines whether T[l..i] 
has a square suffix immediately after reading T[i]. The total time complexity is 
0{hlog^ h). 

Proof. The correctness of Dsq is given in Lemma 1. To show the time bound, 
it suffices to show below that all instances DHangSq(z, j) created by Dsq can be 
finished in total 0(/i log^ h) time. 

Note that Dsq stops before reading the character Hence, it will not 

run DHangSq(z,j) for any level- [log /i] pair (z,j). For any 1 < £ < [log/z], Dsq 
runs DHangSq for 0{hf2^) level-t' pairs. In Section 4, we show that each of these 
instances can be solved in 0{£2^) time (Theorem 4), and thus all these level-^ 
instances can be solved in total 0{£2^{hf2^)) = 0{£h) time. Summing this total 
running time over all the 0(log h) different levels £, the theorem follows. □ 

4 Design and Analysis of DHangSq 

In this section, we describe how to implement DHangSq(z, j). First, we make the 
following observation. 

Fact 2 Suppose that the square T[k..h] hangs in T[i..j]. Then, there are strings 
X and G such that T[k..j] = XGX and T[{j + l)../z] = G. 

The above fact can be verified easily by an example. Note that the square 
r[5..10] = abcabc hangs in T[1..8] = aaaaabca, and T[5..8] = XGX and T[9..10] 
= G where X = a and G = be. Fact 2 motivates the following definition: We say 
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that a string is a pseudo-square with a center G if it is equal to XGX for some 
X. For example, the string abccab is a pseudo-square with a center cc. Followings 
are some simple facts about a pseudo-square. 

— A pseudo-square with an empty string as its center is a square. 

— A pseudo-square may have different centers. For example, the string aabbbbaa 
is a pseudo-square with a center bbbb, and is also a pseudo-square with a 
center abbbba. 

— Every string is a pseudo-square with the center equal to itself. 

Now, we replace Fact 2 in terms of pseudo-squares. 

Fact 3 Suppose i < j < h. The string T[l..h] has a square sujfix hanging in 
T[i--j] if and only if T[i..j] has a suffix that is a pseudo-square with a center 
equal to T[(j -|- l)../i]. 

The above fact suggests a simple way to implement DHangSq(t, j). After 
reading T[i],T[i -|- 1], . . . , T[j], we find out the centers of all pseudo-square suf- 
fixes of T[i..j] and put them in a list L. Then, after reading any T[h] where 
j < h < 2j — i-\-l, we conclude that T[l..h] has a square suffix hanging in T[i..j] 
if the string T[{j -\- l)../i] is in L. 

Note that L can have as many as 0{{j — z)^) entries. In the next section, we 
describe a much more compact list that is still sufficient. We also show how to 
find this list in 0{j — z) time. Then, in Section 4.2, we explain how to make use 
of this list to run DHangSq(z, j) in 0{{j — i) log(j — z)) time. 



4.1 Minimum Suffix- Centers 

In this section, we explain how to find a short list of centers for DHangSq(z, j). 
We say G is a suffix-center (for T[i..j]) at position s {i < s < j) if T[i..j] has a 
pseudo-square suffix T[k..j] = XGX where G starts at T[s]. If G is the shortest 
among all the suffix-centers at s, we say that G is the minimum suffix-center at 
s. For example, if T[3..10] = ccaabbaa, bb is a suffix-center at 7, and obviously, 
it is minimum. The following lemma shows why we are interested in minimum 
suffix-centers. 

Lemma 2. Suppose that h is the smallest integer such that T[i..h] contains a 
square suffix hanging in T[i..j]. Then, G = T[{j -\- l)../i] is a minimum suffix- 
center at some s € {z, z -I- 1, . . . ,j}. 

Proof. From Fact 3, we know that G is a suffix-center at some position s. To 
show that it is minimum, we assume to the contrary that another G' yf G is the 
minimum suffix-center at s. Note that both G and G' are substring of T[i..j] 
starting at s, and because of G' is minimum, we have |G'| < |G|. It follows that 
G' is a prefix of G. Together with the fact that T[{j -\- l)../z] = G, we conclude 
that T[{j -\- l)..h'] = G' for some h' < h. By Fact 3, T[i..h'] contains a square 
suffix hanging in T[i..j\. Hence, T[h] is not the first character arrival such that 
T[i..h] contains a square suffix hanging in T[z..j]. This is a contradiction. □ 
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DHangSq(z, j) is defined as follows: 



1. Construct the list L of all minimum suffix-centers for T[i..j\] 

2. For every h & {j + 1, j + 2, . . . ,2j — i + 1}, after reading T[h], 

check whether T[(j -|- is in L. If T[(j -|- is in L, 

then stop and report finding a hanging square. 



Below, we show how to construct L in 0{j — i) time (Lemma 4). In the 
next section, we show how to check, for each h, whether T[(j + I)../i] is in L in 
0(log(j — i)) time (Lemma 5). Altogether, we have the following theorem. 

Theorem 4. For every h G {j + l,j + 2, . . . ,2j — i + 1}, after reading T[h], 
DHangSq(z, j) correctly detects whether T[i..h] contains a square suffix that hangs 
in T[i..j]. The total time complexity is 0{{j — i) log(j — i)). 

Since there is at most one minimum suffix-center for each position h G {i,i + 
1, . . . , j}, the list L of all minimum suffix-centers for T[i..j] has size 0{j — i). 
Below, we show how to construct L in 0{j — i) time. Note that even though 
there are only 0{j — i) strings in L, many of them may have 0{j — i) characters. 
Thus, we cannot afford to store L as a list of strings. Instead, we will only store, 
for each string in L, a pair of indices (s, e) where s is the starting position, and 
e is the ending position of this string. 

The following lemma characterizes a minimum suffix-center. 

Lemma 3. Let G he the minimum suffix- center for T[i..j] at s. Suppose G is 
the center of the pseudo-square suffix T[k..j] = XGX . Then, X is the longest 
suffix ofT[i..{s — 1)] that is also equal to a suffix ofT[i..j], 

Proof. Since T[k..j] = XGX and G starts at s, we have T[k..{s — 1)] = X, 
r[s..(s-|- |G| — 1)] = G, and T[(s-|- |G|)..j] = X. Hence, T[k..{s — 1)] is a suffix of 
r[z..(s— 1)] that is equal to a suffix of T[i..j], namely T[(s-|- |G|)..j]. T[k..{s— 1)] 
is the longest such suffix because |G| is minimum. □ 

Thus, to find L, it suffices to find for all z < s < j, the longest suffix of 
T[i..{s — 1)] that is also a suffix of T[i..j], The following lemma shows that they 
can be found efficiently. 

Lemma 4. We can find for all i < k < j, the longest suffix of T[i..k] that is 
also a suffix ofT[i..j], Hence the list L of minimum suffix- centers o/T[z..j] can 
he constructed in 0{j — i) time. 

Proof. Note that there is an linear time algorithm to find, for all 1 < z < |P|, the 
longest prefix of P[i..n] that is also a prefix of P (see [5]). By reversing T[i..j] 
and applying this algorithm, we can find, for all z < s < j, the longest suffix of 
T[i..{s — 1)] that is equal to some suffix of T[i..j], and the lemma follows. □ 

4.2 Checking the List L of Minimum SufRx-Centers 

Recall that L is stored as a list of index pairs (s, e). In this section, we show 
how to determine, for every h G {j -I- 1, j -I- 2, . . . , 2j — z -|- I}, whether there is an 
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index pair (s, e) G L such that T[s..e] equals T[{j + l)..h]. Note that checking 
the list L in brute force might take quadratic time. However, we can speedup 
the process drastically if we throw away the index pair (s, e) G L as soon as we 
find that it cannot be equal to T[{j + l)..h] for any h. Here are the details. 

After the first character T\j + 1] arrives, we scan through L. If there is a 
pair (s,e) G L where s = e and T[s] = T[j + 1], we stop as we have found the 
first hanging square. Otherwise, we throw away all (s, e) with T[s] yf T[j + 1] 
because T[s..e] cannot be equal to T[{j + l)../i] for any h] their first characters 
are already different. Then, we repeat the process when the second character 
T[j + 2] is read. In general, when T[j + x] {j + x = h) is read, we can be sure 
that for all remaining (s,e) G L, the first a; — 1 characters of T[s..e] are equal 
to T[{j + l)..(j + X — 1)]. Thus if there is a (s, e) G L with e = s + a; — 1 and 
r[e] = T[j + x], then we have found the first hanging square. Otherwise, we can 
throw away all (s, e) with T[s + a; — 1] yf T[j + a;]. 

Note that when each character T[h] is read, and when we scan L, we only 
check one character for each remaining (s, e) G L. Thus, if we denote by Px the 
set of remaining index pairs in L when we start to process T[j + a;], the running 
time of this checking process is 



\Pl\ + \P2\ + ---\Pm\ (2) 

where T[j + m] {j + m=h),m<j — i + 1 is the last character before we stop. 
It is easy to see, as shown in the lemma below, that \Px\ < {j — i)/x for all 
1 < a; < m. 

Lemma 5. For any 1 < x < m, |iA| < (j — i)/x. Thus, |Pi| + IP 2 I + • • • \Pm\, 
the total time for checking L, is not more than (j — f)(l + 1/2 + • • • + 1/m) = 
0{{j-i) log(j-f)) 

Proof Note that for every pair (s,e) G Px, it has passed the first a; — 1 scans 
and thus the first x characters of T[s..e] must be equal to T[(j + l)../i]. It follows 
that for any two (s, e), (s', e') G Px (assume s < s'), the first x characters of 
r[s..e] and T[s'..e'] are equal, i.e., T[s..(s + a: — 1)] = T[s'..(s' + a: — 1], (both 
are equal to T[{j + l)../i]). It is important to note that T[s..{s + a: — 1)] and 
r[s'..(s' + a; — 1)] are disjoint; otherwise, we have s<s'<s + a; — l<s' + a;— 1 
and together with the fact that T[s..(s + a:— 1)] = T[s'..(s' + a;— 1)], we conclude 
T[s..(s' — 1)] = T[s'..(2s' — s — 1)] and T[s..(2s' — s — 1)] is a square; this is 
impossible because we would have stop earlier. Consequently, there can be at 
most (j — i)/{x — 1) index pairs in Px, or equivalently, \Px\ < (j — i)!{x — 1). 
The lemma follows. □ 

5 Future Work 

We have designed an efficient online algorithm for detecting a square. We find 
that the bottleneck of our algorithm is the checking of the list of minimum 
suffix-centers. Currently, our checking is rather straightforward, and we believe 
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there are some better checking procedure that can help reduce the overall time 
complexity. Another interesting problem is to general our algorithm to detect 
cube, or even higher order of repetitions, in a string. 
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Abstract. We consider the local alignment problem where sequences 
may have masked regions. The bases in masked regions are either un- 
specified or unknown, and they will be denoted by N. We present an ef- 
ficient algorithm that finds an optimal local alignment by skipping such 
masked regions of sequences. Our algorithm works for both the affine gap 
penalty model and the linear gap penalty model. The time complexity 
of our algorithm is 0{{n — T)(m — S') -|- vm -\- wn) time, where n and m 
are the lengths of given sequences a and b, T and S are the numbers of 
base N in o and b, and v and w are the numbers of masked regions of a 
and b, respectively. 



1 Introduction 

A local alignment algorithm finds substrings a and j3 of sequences a and b, 
respectively, where a is similar to /? [7]. A local alignment algorithm is used in 
many applications such as DNA sequencing programs. When the lengths of a 
and b are n and m, respectively. Smith and Waterman [12] gave a well-known 
0{n^m) time local alignment algorithm for the affine gap penalty model and 
Gotoh [5] improved it to 0{nm) time. Crochemore et al. [2] gave a subquadratic 
time algorithm but it considers only a linear gap penalty model. 

In DNA sequencing [1,6,8,9,13], some bases of fragments lose their infor- 
mation due to various reasons. RepeatMasker [11] screens DNA sequences for 
interspersed repeats and low complexity DNA sequences. Low-quality bases of 
DNA sequences are also screened by vector screening [1, 6, 9]. Some bases of frag- 
ments may not be determined by a base caller [3] . These unspecified or unknown 
bases will be denoted by N in this paper. Since masked regions lost their base 
information, alignments with masked regions are meaningless. 

In this paper we present an efficient algorithm that finds an optimal local 
alignment by skipping such masked regions of sequences. In spite of skipping 
masked regions, the local alignment that our algorithm finds is the same as the 
one that the Smith- Waterman-Gotoh algorithm finds by computing all entries. 

* This work was supported by the MOST grant Ml-0309-06-0003. 
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To skip masked regions, we first give a new recurrence for insignificant entries 
in a dynamic programming table, and then develop an efficient algorithm based 
on the new recurrence. Our algorithm runs in 0{{n — T){m — S) + vm + wn) 
time, where T and S are the numbers of base N in a and 6, and v and w are the 
numbers of masked regions of a and b, respectively. 

The remainder of the paper is organized as follows. In Section 2, we describe 
the local alignment algorithm due to Smith, Waterman, and Gotoh, and we 
extend it to include base N. In Section 3 we give a new recurrence for insignificant 
entries. In Section 4 we present a new local alignment algorithm for the affine 
gap penalty model that is based on the new recurrence. Section 5 gives the 
time complexity analysis of our algorithm. Section 6 briefly describes a simpler 
algorithm for the linear gap penalty model. 

2 Preliminaries 

We first give some definitions and notations that will be used in our algorithm. A 
string or a sequence is concatenations of zero or more characters from an alphabet 
S. A space is denoted hy A ^ S] we regard Z\ as a character for convenience. 
The length of a string a is denoted by |a|. Let at denote fth character of a string 
a and Uij denote a substring aiOi+i . . .aj of a. When a sequence a is a substring 
of a sequence a, we denote it by a A a. Given two strings a = ai02 . . . a„ and 
b = 6162 ■ • ■ bm, an alignment of a and b is a* = 0*03 ■ . ■ af and b* = 6*63 ■ ■ - bi 
constructed by inserting zero or more Z\s into a and b so that each a* maps to 
b* for 1 < i < 1. There are three kinds of mappings in a* and b* according to 
the characters of a* and b*. 

• match : a* = b* A, 

• mismatch : (a* yf b*) and {a*,b* yf A), 

• insertion or deletion (indel for short) : either a* or b* is A. 

Note that we do not allow the case of a* = b* = A. 

2.1 Local Alignments 

Given two sequences a and b, a local alignment of a and b is an alignment of two 
strings a and /? such a ^ a and /3 A 6, and an optimal local alignment of a and 
6 is a local alignment of a and b that has the highest similarity among all local 
alignments of a and b. We denote the similarity of an optimal local alignment 
by SL{a,b). 

A well-known algorithm to find an optimal local alignment was given by 
Smith and Waterman [12], and Gotoh [5], which will be called the Smith- 
Waterman-Gotoh algorithm (SWG algorithm for short) [12,14,5]. Given two 
sequences a and b where |a| = n and |5| = m, the SWG algorithm com- 
putes SL{a,b) using a dynamic programming table (called the H table) of size 
(n-l- l)(m-|- 1). Let Hij for 0 < z < n and 0 < j < m denote SL{ai^i, bij). Then, 
Hij can be computed by the following recurrence: 
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A C G T N 



A 1-2-2-2 0 

C -2 1 -2 -2 0 

G -2 -2 1-2 0 

T -2 -2 -2 1 0 

N 0 0 0 0 0 



Fig. 1. Scoring matrix for DNA sequence where 5 = 2 and cr = 0. 



Hij = + s(ai, bj), Cij, Rij, 0} for 1 < i < n, 1 < j < m 

where Cij = maxl<fc<^{i^i_feJ -gfc} and R^j = Ina,Xl<k<j{H^,j-k-gk}■ s{ai,bj) 
is a similarity between elements and bj such that 



and Qk is the gap penalty for an indel of fc > 1 bases such that gk = "f + kg 
where <5, 7, and g are non-negative constants. Among all Hij for 0 < i < n and 
0 < j < m, the highest value is SL(a, b), and the SWG algorithm takes 0{nm) 
time to compute it [5]. 

2.2 Scoring Matrix 

In case of DNA sequences, there are four bases, A, C, G, and T. In addition, 
lUB ambiguity codes [10] shows several other bases which are for incompletely 
specified bases. In practice, however, bases N and X are only used. N means 
‘unspecified’ and X means ‘unknown’. These bases are used when the original 
bases are screened by repeat masking or vector screening, or when the original 
bases are not determined by a base caller. In this paper we will use only N for 
an unspecified or unknown base. 

We extend the definition of similarity s(ai,bj). Because N might be any 
base, it is impossible to determine whether an alignment with N is a match or a 
mismatch. Hence, similarity s{ai,bj) is newly defined such that 



where cr is a constant (which may not be 0) [4,6]. Figure 1 shows an example 
of the scoring matrix for DNA sequence alignments where S = 2 and cr = 0. We 
call an entry Hij where a, = N or bj = N an insignificant entry. 

2.3 Gap Penalty Models [7] 

We defined the gap penalty gk StS gk = j + kg where 7 and g are non-negative 
constants. This is called the affine gap penalty model, where 7 is the gap initiation 
penalty and g is the gap extension penalty. 

When there is no gap initiation penalty, i.e., gk = kg, we call it the linear 
gap penalty model. 
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3 New Recurrence for an Insignificant Entry 

In this section we present a new recurrence for an insignificant entry Hij. We 
begin with some lemmas that are needed to get the new recurrence. 

We define C^j = Rio = — oo for 1 < f < m and 1 < j < n and define go = 0. 
Let a and b be two DNA sequences where |a| = n and |6| = m. Suppose that 
Qi-t+ 1,1 is a masked region of length t, i.e., ai-t+i,i = NN. . . N. 

Lemma 1. Hij > Hi-yj-x + za — gx-z — 9y-z for some constants x, y and z 
such that 0 < X < j, 0 < y <t, and 0 < z < min{x, y}. 

The above lemma provides a relation between Hij and its previous defined en- 
tries. If equality holds in Lemma 1, we say that Hij comes from Hi-y^-x- In 
other words, an optimal local alignment which has a mapping between and 
bj also has a mapping between Ui-y and bj-x- 

We will use ‘come from’ for some more cases. If = Hi-k,j — 9k and Hi-k,j 
comes from Hi-yj-x, we say that Cij comes from Hi-y^-x- If Ci-yj-x = 
Hi—y—kj—x 9k and Hij = Hi—y—f^ j—x T (y -t- /c t)(J gt 9x—y—k+t for t ^ 
we say that Hij comes from Ci-yj-x- Similarly, we define that Cij comes from 
Ci — y^j — X- 

The following lemmas provide the range of entries which Hij can come from. 

Lemma 2. If Hij comes from Hi-tj-u for u > 0, there exist at least one 
Hi-t,j-s which Hij comes from for some constant s such that 0 < s <t. 

Lemma 3. If Hij comes from Ci-tj-u for u > 0, there exist at least one 
Ci-tj-s which Hij comes from for some constant s such that 0 < s < t — 1. 

The following lemmas provide the range of entries which Cij can come from. 

Lemma 4. If Cij comes from Hi-t,j-u for u > 0, there exist at least one 
Hi-t,j-s which Cij comes from for some constant s such that 0 < s < t — 1. 

Lemma 5. If C\j comes from Ci-tj-u for u > 1, there exist at least one 
Hi_tj-s which Cij comes from for some constant s such that 0 < s < t — 1. 

The following lemmas handle the case Hij can come from the leftmost column. 
(For Cij, results similar to Lemmas 6 and 7 hold.) 

Lemma 6. If t < j <m and Hij comes from Hi-u,o for 0 < u <t, Hij comes 
from Hi-t,j-s for 0 < s < t. 

Lemma 7. If 0 < j < t and Hij comes from Hi-u,o for 0 < u < t, Hij comes 
from iLj- • 

By above lemmas, we derive the following theorem which provides a new recur- 
rence for an insignificant entry Hij. 
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Theorem 1. Let be a masked region of length t. Then Hij can he com- 

puted by the following recurrence: 



max • 



= 



max « 



max {Hi^t,j-t-\-k + (t - k)a - j - knjf 
ja-j- II, C^-tj - til ^ 

max^{i7j_tj_t+fe + (i - k)(7 - 7 - kii}A 

Ci—tJ til J 



for I < j <t 
for t < j < m. 



max • 



H.j = { 



max • 



max {Ci-tj-ti-k + (t- k)a - kii}f 

jcr, Cy-.O , 

max {Ci-tj-t-ik + (t - k)a - kii},^ 

X k i — X L 

, J- tc, C^j , 0 J 



for 1 < j <t 
for t < j < m. 



Suppose that bi-t+i^i is a masked region of length t. The above lemmas and 
Theorem 1 hold in the symmetric case. In other words, they hold if we replace 
by Hji and C^- by Rji. 



4 New Algorithm for AfRne Gap Penalty Model 

A straightforward method to find an optimal local alignment of masked se- 
quences a and b where |o| = n and |6| = m is to run the SWG algorithm in 
0{nm) time. We want to find the local alignment without computing insignifi- 
cant entries. 

We present an algorithm for the affine gap penalty model. Our algorithm 
consists of the following four cases: 

Case 1: Neither Ui nor bj is N. Compute all these entries by the SWG algo- 
rithm. 

Case 2: Only ai is N. Compute the bottom row of a block of insignificant 
entries. 

Case 3: Only bj is N. Compute the rightmost column of a block of insignificant 
entries. 

Case 4: Both at and bj are N. Compute the bottom-right corner entry of a 
block of insignificant entries. In addition, compute a column to the left 
of the block and a row on top of the block. See Figure 2. 

Figure 2 shows an example of four cases in the H table. We explain our algorithm 
for Case 2 and Case 4, since Case 3 is symmetric to Case 2. 



4.1 Case 2 



Let Oi-t+ij be a masked region of length t, i.e., ai-t+i^i = NN. . . N. We will 
show how to compute the bottom row ^ of a t x g block in 0{q) time. 

To do this, we first find the relation between two adjacent insignificant entries. 
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case 4 
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. ! m 





Fig. 2. An example of four cases in the H table. In Case 1, we compute all the entries. 
In Case 2, we compute only row E\. We compute only column E 2 in Case 3. In Case 
4, we compute an entry E 3 and compute row E 4 and column E 5 . 



Then, we define a new data structure, MQUEUE, and give an algorithm of 0{q) 
time. 

Computing the bottom row without computing other insignificant entries is 
possible by using the recurrences for an insignificant entry in Theorem 1. But it 
still takes 0{tq) time for a t x g block because it needs 0{t) computations for 
each Cij and Hij. However, we can reduce the time complexity to 0{q) using 
some properties of the recurrences. 

Two adjacent entries of C are very similar. See Figure 3. For t < j < m, Cij 
and Ci are computed as 

Cij = max{ max {Hi-tj-t+k + {t - k)a - 7 - fc^}, Ci-±,j - tji} , 

Cij+i = max{ max {Hi_t^^+i_t+k + (* - k)a - 7 - fc/r}, Ci_t,j+i ~ tlA ■ 



The differences between Cij and Cij+i are that (1) the element Hi-t,j-t+i + 
{t — l)a — 7 — ^ is deleted, ( 2 ) cr + /r is added to all elements except the last of 
Cij, (3) new element Hi-t,j+i — 7 — t/r is inserted, (4) the element Ci-tj — 
is deleted, and (5) the element Ci-tj+i — tfi is inserted. Thus Cij+i can be 
rewritten as 



C, 

An 



f + {t-k)a--f- kfj.}A 

ij+i = max < + > 

I Ci-t,j+l — (T — {t + l)/i J 

a insignificant entry Hi n_i_i can also be rewritten as follows 



+ (T + ^. (1) 



H, 



hi+i 



m&x^{Ci-tj-t+k + (t- k)a - 7 - kqi}, '' 

+ {t- l)a - fi, Cij+i - a - fi,-a - fi, 



+ (T + ^. 



Similarly, for 1 < j < t, we can compute Cij+i and Hi^+i using Cij and Hij, 
respectively. 

We need a new data structure to find maximum values in 0(1) time. Be- 
cause the element Hi-t,j-t+i + {t ~ l)o' — 7 — ^ is deleted in Recurrence (1), 
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(a) b (b) 

0 1 2 j j+1 



i-t 0 I I • i-t 

a : a 

i 0 n;j 

Fig. 3. (a) 1 < J < J- To compnte Cij, we need and for 1 < k < j 

and to we need Hi-j- 2,0 ~ 0 and Hi-t,k for 1 < < j + 1- (b) t < j < m. 

To compute Cij and Cij+i, we need Hi-t,j-t+k and Hi-tj+i-t+k, respectively, for 
Kk<t. 



0 j-t+i 



j j +1 




m&xi<k<t{Hi-tj-t+k + (i ~ k)a — 7 — kfx} which is already computed in Cij 
cannot be directly used in computing Cij+\ . However, if we know the maximum 
value for the range of k such that 2 < fc < t, it can be used in Cij+i. We will 
find the maximum value using MQUEUE. 

We define a specialized queue data structure, MQUEUE, that supports the 
following three operations: 

maxO : Return the maximum score among all elements in MQUEUE. 
insert (e) : Add an element e to MQUEUE. 
delete 0 : Remove the oldest element in MQUEUE. 

The maxO operation is added to the traditional queue. 

To obtain 0(1) amortized time for each operation, MQUEUE maintains a 
list of maximum candidates. Maximum candidates are defined as follows: Given 
a set of ordered elements, the first maximum candidate is an element that has 
the maximum score in the list. If the ith maximum candidate is defined, then 
the (i + l)st maximum candidate is an element that has the maximum score 
in the range from the next element of the ith maximum candidate to the last 
element of the list. Figure 4 (a) shows a list of maximum candidates. 

The list of maximum candidates is maintained as a doubly linked list. A 
pointer head points to the maximum element of MQUEUE and tail points to 
the last one. The important property of the list is that the scores of the elements 
are in descending order. 



(a) 

score 

index 



head 



(b) 



head 



20 


30 


15 


20 


15 


5 


10 


score 


20 


30 


15 


20 


15 


5 


10 


25 


3 


4 


5 


6 


7 


8 


9 


index 


3 


4 


5 


6 


7 


8 


9 


10 



Fig. 4. An example of MQUEUE. (a) The list of maximum candidates is 30 ^ 20 — > 
15 — > 10. (b) After insert (25), the list is modified to 30 — > 25. 
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j-s j-1 j 



-t-s+1 




-- 




i-t 









Fig. 5. Two MQUEUEs for Hi-tj and Hi-tj-i- 

Operation insert (e) needs to maintain the list property. After adding an 
element e, the score of e is compared with the maximum candidates from tail 
until it meets the score that is larger than the score of e or the list becomes 
empty. When it meets a larger one, they link each other and tail points to e. 
When the list becomes empty, we make a new list which has one element e and 
head and tail point to e. Figure 4 (b) shows an example of insert (e). 

Operation delete () removes the oldest element from MQUEUE. If head 
points to the oldest element, then head is set to the next element in the list and 
we remove the oldest element. Operation maxO returns the score of the element 
which is pointed to by head. 

The algorithm for one block of insignificant entries with t rows consists of 
two for loops. The first for loop computes Hij for 1 < j < t and the second for 
loop computes Hij for t + 1 < j < m. We use two MQUEUEs, one for Ctj and 
the other for Hij. In addition, we compute Rij because it is used in Case 4. 

4.2 Case 4 

We will show how to compute the bottom-right corner entry Hij of a t x s block 
and how to compute the row on top of the block and the column to the left of 
the block. Let Oi-t+iy and bj-s+i,j be masked regions, i.e., ai-t+i,i =NN. . .N 
and bj_sj^ij =NN. . . N. 

Lemma 8. If Hij comes from Ci-tj-k for 1 < k < min{s,t} — 1, Hij comes 
from Ci-t,j or Hi-tj-u for 0 < u < min{s, t} — 2. 

Lemma 9. If Hij^.^ comes from Ci-tj+r-k forr+1 <k< min{min{s,t} — 1-1- 
r, t — 1}, Hij+r comes from Ci-tj or Hi-t,j+r-u for r < u < min{min{s,t} — 
2 -f- r, t — 2} . 



Lemmas 8 and 9 show that in order to compute the bottom-right corner entry 
Hij, we need Hi-t,j-k for 0 < k < t and only Ci-tj (i.e., we don’t need to 
compute Ci-t,j-k for ^ < k < t). Cij also can be computed similarly to Hij. 
Since Ci-t,j is already computed in Case 3, we will show below how to compute 

Hi—t,j — k ■ 

For 1 < A: < minjt, s} — 1, we compute Hi-tj-k (row E4 in Figure 2) using the 
MQUEUE that was constructed to compute Hi-tj in Case 3, i.e., we compute 
Hi_t j_k from right to left. See Figure 5. The difference between two MQUEUEs 
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for Hi-tj and Hi-tj-i is that the oldest element, Ri-t-s+i,j-s, in MQUEUE 
for is deleted. Thus, after one delete, we have MQUEUE for Hi-tj-i- 

Of course, to compute M is added to max. Similarly, we can compute 

Hi-k,j-s for 1 < fc < min{t, s} — 1 (column in Figure 2). 

The entries in row E 4 of Figure 2 will be used in Case 2 which is to the right of 
the current block. To compute the entry in Case 2, we need 

and Ci-t,j+i-t+i..j+i among which and Ci-tj+i-t+i..j are the en- 

tries that are lain outside of the block of Case 2. We already have computed 
in Case 4 which is to the left block of the Case 2. For C, we need 
only Ci-tj by Lemma 9 and already have it. Similarly, the entries in column E^ 
of Figure 2 will be used in Case 3 which is below the current block. 



5 Analysis 

Consider the time complexity for each of the four cases. In Case 1, the algorithm 
takes 0{pq) time for a p x g block because we use the SWG algorithm to compute 
Efij. Since Case 3 is symmetric to Case 2, we consider Cases 2 and 4. 

In Case 2, the algorithm takes 0{q) time for a t x g block. There are q + 1 
insert, q max and q—t delete operations. Because max and delete take constant 
time, q max and q — t delete operations take 0{q) time. For insert, g-l-l insert 
operations add q+1 elements. Moreover, each insert makes some comparisons 
to maintain the list of maximum candidates. If there are c comparisons in one 
insert, the number of elements in the list of maximum candidates is reduced 
by c — 1. Hence, each insert takes amortized 0(1) time and thus the total time 
for Case 2 is 0(g). 

In Case 4, the algorithm takes 0(min{t, s}) time for a t x s block. Because 
it computes one row and one column with minjt, s} entries and the operations 
of MQUEUE take amortized 0(1) time, it takes 0(min{t, s}) time. 

Now consider the worst case time complexity of our algorithm. The worst 
case is that the lengths of all masked regions of a and b are the same (say, t). 
Let V and w be the numbers of masked regions of a and b, respectively, and let T 
and S be the total numbers of N in a and b, respectively (i.e., T = vt, S = wt). 
Case 1 covers {n — T){m — S) entries and it takes 0{{n — T){m — S)) time. Since 
Case 2 covers vtx{m — S) entries. Case 3 covers wt x {n — T) entries, and Case 
4 covers vt x wt entries, they take 0(vm + wn) time. Therefore, the total time 
complexity is 0((n — T){m — S') -I- vm + wn). 

6 Algorithm for Linear Gap Penalty Model 

Because the linear gap penalty model is a special case of the affine gap penalty 
model, it can be solved by the algorithm for the affine gap penalty model. From 
Lemmas 2,6, and 7, however, we derive the following theorem which provides a 
simpler recurrence for an insignificant entry Hij. 
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Theorem 2. 

for 1 < j <t 
for t < j < m. 

Using the new recurrence of Theorem 2, we can give a local alignment algo- 
rithm for the linear gap penalty model that is similar to the algorithm for the 
affine gap penalty model, and the time complexity is also 0((n — T)(m — S') -I- 
vm + un). 
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Abstract. The Degree-A Closest Phylogenetic Icth Root Prob- 
lem (ACPRfc) is the problem of finding a (phylogenetic) tree T from a 
given graph G — (V,E) such that (1) the degree of each internal node 
of T is at least 3 and at most A, (2) the external nodes (i.e. leaves) of 
T are exactly the elements of V , and (3) the number of disagreements, 
\E © {{m,w} : u,v are leaves of T and (It(u,v) < fc}| does not exceed 
a given number, where cIt{u,v) denotes the distance between u and v 
in tree T. We show that this problem is NP-hard for all fixed constants 
A, fc > 3. 

Our major technical contribution is the determination of all pylogenetic 
roots that approximate the almost largest cliques. Specifically, let SA{k) 
be the size of a largest clique having a fcth phylogenetic root with maxi- 
mum degree A. We determine all the phylogenic fcth roots with maximum 
degree A that approximate the (szi(fc) — l)-clique within error 2, where 
we allow the internal nodes of phylogeny to have degree 2. 



1 Introduction 

A phylogeny is a tree where the leaves are labeled by species and each inter- 
nal node represents a speciation event whereby an ancestral species gives rise 
to two or more child species. The internal nodes of a phylogeny have degrees 
(in the sense of unrooted trees, i.e. the number of incident edges) at least 3. 
Specifically, interspecies similarity is represented by a graph where the vertices 
are the species and the adjacency relation represents evidence of evolutionary 
similarity. A phylogeny is then reconstructed from the graph such that the leaves 
of the phylogeny are labeled by vertices of the graph {i.e. species) and for any 
two vertices of the graph, they are adjacent in the graph if and only if their 
corresponding leaves in the phylogeny are connected by a path of length at most 
k, where fc is a predetermined proximity threshold. To be clear, vertices in the 
graph are called vertices while those in the phylogeny nodes. Recall that the 
length of the (unique) path connecting two nodes u and v in phylogeny T is the 

* Supported in part by the Grant-in-Aid for Scientific Research of the Ministry of 
Education, Science, Sports and Culture of Japan, under Grant No. 14580390. 
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number of edges on the path, which is denoted by driu, v). This approach gives 
rise to the following algorithmic problem [5]: 

Phylogenetic kTu Root Problem (PR/c): 

Given a graph G = {V, E), find a phylogeny T with leaves labeled by the 
elements of V such that for each pair of vertices u,v G V, {tt, w} € if if and 
only if dT{u,v) < k. 

Such a phylogeny T (if exists) is called a phylogenetic kth. root, or a fcth root 
phylogeny, of graph G. Graph G is called the /cth phylogenetic power of T. For 
convenience, we denote the /cth phylogenetic power of any phylogeny T a,sVk(T). 
That is, Vk{T) has the vertex set L{T) = {u : u are leaves of T } and the edge 
set = {{u,u} I u and v are leaves of T and dT{u,v) < k}. Thus, FKk asks 
for a phylogeny T such that G = Vk{T). 

The input graph in PRfc is derived from some similarity data, which is usually 
inexact in practice and may have erroneous (spurious or missing) edges. Such 
errors may result in graphs that have no phylogenetic roots and hence we are 
interested in finding approximate phylogenetic roots for such graphs. For a graph 
G = {V,E), each tree T whose leaves are exactly the elements of V is called an 
approximate phylogeny of G, and the error ofT is \T^®E\ = |(if— T'')U(T^— if)|. 
This motivated Ghen et. al. to consider the following problem: 

Closest Phylogenetic /cth Root Problem (CPRfc): 

Given a graph G = {V, E) and a nonnegative integer £, decide if G has an 
approximate phylogeny T with at most £ errors. 

An approximate phylogeny of G with the minimum number of errors is called 
a closest kth root phylogeny of graph G. 

The hardness of PR/c for large k seems to come from the unbounded degree 
of an internal node in the output phylogeny. On the other hand, in the practice 
of phylogeny reconstruction, most phytogenies considered are trees of degree 3 [7] 
because speciation events are usually bifurcating events in the evolutionary pro- 
cess. We call these restricted versions the Degree- Z\ PRfc and the Degree- Z\ 
CPRfc problems, and denote them for short as Z\PRfc and ACPRfc, respectively. 

1.1 Previous Results on Phylogenetic Root Problems 

PRfc was first studied in [5] where linear-time algorithms for PR2 and PR3 
were proposed. A linear-time algorithm for the special case of PR4 where the 
input graph is required to be connected, was also presented in [5]. At present, 
the complexity of PRfc with fc > 5 is still unknown. 

Chen et. al. [2] presented a linear-time algorithm that determines, for any 
input connected graph G and constant A > 3, if G has a fcth root phylogeny 
with degree at most A, and if so, demonstrates one such phylogeny. On the other 
hand, Chen et. al. [2] showed that CPRfc is NP-complete, for any fc > 2. One of 
their open questions asks for the complexity of ZlCPRfc. 
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Of special interest is CPR 2 . The problem CPR 2 is essentially identical to 
the correlation clustering problem which has drawn much attention recently [1]. 
The proof of the NP-hardness of CPR 2 given in [2] is also a valid proof of the 
NP-hardness of the correlation clustering problem. Blum et. al. [1] rediscovered 
the NP-hardness of the problem. 

1.2 Our Contribution 

In this paper, we will show that Z\CPR/c is NP-complete, for any k > 3 and 
Z\ > 3. This answers an open question in [2]. 

In a course of the proof we first reduce Hamiltonian Path, a famous NP- 
complete problem, to Z\CPR3, and then Z\CPR3 to Z\CPR/c. The former re- 
duction is tedious but a routine work. On the other hand, the latter reduction 
seems to require new combinatorial investigation that is proper of Z\CPRfc. 

A {A, k, h, £)-core graph is a graph G = {V, E) with the following properties: 

— There is a tree T of maximum degree A whose phylogenetic A:th power is G 
and such that T has a unique unsaturated {i.e. degree < A) internal node a, 
the degree of a is A — 1, dria, u) = h holds for one leaf u and dria, v) > h+1 
hold for all leaves v other than u. 

— For every approximate phylogeny T of G with maximum degree A and at 
most £ errors, there is at most one pair (a, u) such that a is an unsaturated 
internal node of T, uis & leaf of T, and dT{a,u) < h; moreover, if {a,u) 
exists then the degree of a in T is Z\ — 1. 

Then, we establish the reduction from Z\CPR3 to Z\CPR/c by providing 
a family of {A,k, [fc/2j — l,2)-core graphs for every fixed A > 3 and fc > 4. 
Our construction of a {A, k, [k/2\ — 1, 2)-core graph is a pile of {A, k' , 1, 2)-core 
graphs for A:' = 5, 7, . . . , fc if A: > 5 is odd, and A;' = 4, 6, . . . , A: if A; > 4 is even. 
So, a more basic problem is to prove that a certain graph is a (Z\, A:, 1, 2)-core 
graph. 

The maximum size of a clique having a (no-error) A:th root phylogeny with 
maximum degree A is given by the following function, 

j A ■ {A — 1)^“^, if k is even, 

^ 'I 2 • (Z\ — 1) “ 2 “ , if A: is odd. 

We prove that the clique of size S/\(A:) — 1 is a {A, k, 1, 2)-core graph. Moreover, we 
determine the all A:th root phytogenies with maximum degree A that approximate 
the clique within error 2, where we allow the internal nodes of phylogeny to have 
degree 2. For example, all phylogenetic roots of the ( 53 ( 5 ) — l)-clique are D 5 in 
Figure 1, in Figure 2, and the tree obtained from by removing the leaf u. 

2 Notations and Definitions 

We employ standard terminologies in graph theory. In particular, for a graph 
G, V{G) and E{G) denote the sets of vertices and edges of G, respectively. 
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Two graphs G = (V, E) and G' = (V', E') are isomorphic if there is a one- 
to-one correspondence (p between V and V' such that {u,u} G E if and only 
if {ip{u) , (p{v)} G E', and we denote it as G G' . Let G be a graph. Let u 

and V be two vertices in G. The distance between u and u in G is the number 
of edges in a shortest path from u to w in G, and is denoted by dciu^v). The 
neighborhood of f in G is the set of vertices adjacent to v in G, and is denoted 
by Nc{v). The degree of u in G is and is denoted by daiv). Similarly, 

for a tree T, V{T), E{T), and L{T) denote the sets of nodes, edges and leaves 
of T, respectively. 

We also introduce some new terminologies of trees for convenience. For a tree 
T of maximum degree A, an internal node a of T is unsaturated if dT(oi) < A — 1. 
Tree T is i-extensible Hi = '^„{A — degTiv)) , where the summation is taken over 
all unsaturated internal nodes u of T. A tree T is h-away if for each unsaturated 
internal node x of T, the minimum distance from a; to a leaf is at least h and 
further there is exactly one leaf Vx such that dT{x,Vx) = h. For any set U of 
nodes of T, T[U] denotes the minimum subtree containing U. Note that each 
leaf of T[U] belongs to U. A phytogeny is a tree that contains no degree-2 nodes. 
As already mentioned, the fcth phylogenetic power of any tree T is denoted as 
Vk{T) = (L(r),r^), where is the set of all edges {u,v} with {u,v} C L{T) 
and driu, v) < k. 

3 Construction of [fc/2j — l,2)-Core Graphs 

In this section we give a construction of (3, k, [k/2\ — 1, 2)-core graphs for every 
odd /c > 5. It is straightforward to generalize the arguments of this section to 
obtain (A, k, [k/2\ — 1, 2)-core graphs for every A > 3 and fc > 4. 

Throughout this section, all trees and phytogenies are of maximum degree 
3 or less. We abbreviate S 3 (fc) as s{k). The proofs of the most lemmas and 
corollaries are omitted due to lack of space. 

Obviously, up to isomorphism, there is exactly one tree of maximum degree 
3 whose fcth phylogenetic power realizes an s(/c)-clique. We denote this tree by 
Gfe. Similarly, up to isomorphism, there is exactly one tree of maximum degree 
3 and having exactly one degree-2 node whose /cth phylogenetic power realizes 
an (s{k) — l)-clique. We denote this tree by Dk- Figure 1 depicts D^, D^, and 

L>6. 

Lemma 1. For every tree T (of maximum degree H), if there are two leaves u 
and V with dT{u,v) = k and all leaves w of T have distance at most k from both 
u and V, then T is isomorphic to a subtree of Ck- 

Corollary 1. For any tree T, if dT{u,v) < k for all leaves u and v, then T is 
isomorphic to a subtree of Ck ■ 

Fact 1 For every tree T with \L{T)\ = s{k) — 1 and |T^| > ~ have 

driu^v) < k for all but at most two unordered pairs {u,u} of leaves of T . 
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Fig. 1. Di, Di and D^. 



Lemma 2. Let fc > 4. Let T he an arbitrary tree such that \L{T)\ = s{k) — 1 
and |T^| > ~ Then, there are leaves u and v of T with dxiujv) = k. 

Lemma 3. Let k > 6. Let T he an arbitrary tree having s{k) — 1 leaves. Sup- 
pose that there are three leaves u,v,w of T such that dT{u,v) = k and further 
vo.&'x.{dT{u,w),dT{v,w)) > k -\- 1. Then, |T^| < ~ 3. 

Lemma 4. Let k > 6. Let T be a tree having s{k) — 1 leaves such that \T^\ > 
ThenT is 0- extensible or 1- extensible. Moreover, ifT is 1-extensible 
then T = Dk. 

Proof. By Lemma 2 we have two leaves u and v of T such that dT{u,v) = k. 
Moreover, by Lemma 3, all leaves w of T partake distance at most k from both u 
and v. So, by Lemma 1, T is isomorphic to a subtree of Ck, where |L(C^)| = 
\L{Ck) \ — 1. Since all internal nodes of Ck have degree 3, is obtained from Ck 
by removing one leaf, or two sibling leaves. In the former case, T = Dk, while in 
the latter case, T is 0-extensible. 

For k € {4,5}, let Ek be the tree in Figure 2. 




Fig. 2. E 4 and F 5 . 



Lemma 5. Let k G {4,5}. Let T he a tree having s{k) — 1 leaves such that 
\T^\ > ~ Then T is 0-extensible or 1-extensible. Moreover, ifT is 

1 -extensible then T = Dk or T = Ek. 

Proof. By lemma 2, T has leaves u and v with dT{u,v) = k. Let S = {w & 
L{T) : > fc+ 1 or > fc+ 1}. By assumption, 0 < [S'! < 2, and 
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Fact 1 holds. By Lemma 1, T[L{T) — S] is isomorphic to a subtree C'^ of Ck- Let 
Lp be the isomorphism. Root T at v and Ck at ip{v). Let tt" be the grandparent 
of ip{u) in Ck- The proof proceeds in three cases. 

Case 1: [S'! = 0, z.e. S' = 0. In this case, T = C'f.. So, either T = Dk or T is 
0-extensible. 

Case 2: |S| = 1. Let S = {w}. Without loss of generality, dT(u, w) > dT{v,w). 
Let w' be the lowest ancestor of w with (1t{w') = 3. 

Subcase 2.1: w is not a descendant of Root Ek at the non-leaf node 

adjacent to v. We show that T = Ek, where leaf u of Ek corresponds to leaf w 
of T, the uncle of u in Ek corresponds to leaf v of T, and a child leaf of v in Ek 
corresponds to u (see Figure 2). First note that for all leaves x of Ck that are 
descendants of u” , ii x & C'j. then (1t{x,w) > A: -I- 1. By Fact 1, at most two of 
them can belong to L{C'f.). Moreover, at least one leaf descendant of <p{w') in 
Ck does not belong to L{C'^). Now, since \L{T) — {w}| = |L(C(,)| = s{k) — 2, 
we have that the sibling and uncle of u belong to L{C'f.), and (1t{u, w) = k + 1. 
Consequently, T = Ek- 

Subcase 2.2: w is a descendant of Since d,T{u,w) > dT{v,w), we 

have fc = 4 and dT{v,w) = dT{u,w) > fc -|- 1 = 5. By Fact 1, the leaves of C4 
that are of distance 2 from either fp{u) or fp{y) do not belong to C4. Moreover, 
so does at least one descendant of ‘p{w’)- Therefore, s(4) — 2 = \L{T) — {w}| = 
|T(C'4)| < s(4) — 3, a contradiction. 

Case 3: [S'] = 2. Let S = {w,y}- In this case, we show that T = Ek, where 
leaf u of Ek corresponds to that of T, the leaf of Ek whose distance from v is 
2 corresponds to leaf u of T and the two leaves of Ek adjacent to v correspond 
to w,y (see Figure 2). We may assume dT{u,w) > dT{v,w)- Then, w is not a 
descendant of otherwise we could get a contradiction as in Subcase 2.2. 

Similarly, y is not a descendant of Let w' be the lowest ancestor of w 

in T that is a node of T[L{T) — S']. We already have two nodes in S that have 
distance greater than k from m or w in T. So, by Fact 1, none of the sibling nor 
cousins of belong to L{C'f.)- In addition, so does at least one leaf descendant 
of y}{w')- Thus, s{k) — 3 = \L{T) — S| = |L(C'(,)|. Consequently, the uncle of 
ip{u) belongs to L{C'f.), w and y are the only descendants of w' that belong to 
L{Ck) — L{C'f.), and dr{u, w) = dx{u, y) = k+1- So, T = Ek- 

Corollary 2. For every A: > 4, an {s{k) — l)-clique is a (3, A;, l,2)-core graph- 

Now we are ready to construct a (3, k, [k/2\ — 1, 2)-core graph for every odd 
k > 5- We recursively construct trees Rk,[k/ 2 ]-ij k = 5,7,9,..., and define a 
family of {3,k, [k/2\ — l,2)-core graphs as the A:th phylogenetic power of the 
trees (see Figure 3 for Ry ^2 )■ 

— Let hk = [A;/2J — 1. For each 1 < i < hk,iet g{i) = rij=i(s(2j + 3) — 1). Let 

ff(0) = 1. 

— Rk,hk is a leveled tree of depth hk such that at depth z (0 < A < hk) are 

placed g{i) nodes and each node v at depth i < hk is connected to some 

s(2z -I- 5) — 1 nodes at depth z -|- 1. 
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— Rk,hk is an expansion of Rk,hk such that each node v of Rk,hk depth i 
(0 < i < /ife — 1) is expanded to a copy D{v) of I? 2 i +5 in Rk,hkJ where v is 
identified with the degree-2 node of D{v) and the child nodes of v in Rk,h;, 
are identified with the leaves of D(v) in an arbitrary one-to-one manner. 

By construction, Rk,ht: is 1-extensible and /i^-away, and has a unique degree-2 
node, namely, the unique degree-2 node of D^. 

Lemma 6. Let k > 4. Let T he a 0-extensible tree, having s{k) — 1 leaves, and 
|yfc| > ~ 2. Let F be the tree obtained by connecting a new leaf to a leaf 

w ofT. Then, - 3. 

Lemma 7. Let k > 5 be odd. Let T he a tree such that L(T) = L(Rk) and 
\T^ 0 i?fc| < 2. Then, T is 0-extensible or 1-extensible. Moreover, if T is 1- 
extensihle then it is hk-away. 

Proof. By induction on A: > 5. The case k = 5 has been done in Lemma 5, 
because i?s = D^. So we fix A: > 7. Let R = Rk, R = Rk and h = hk, for 
simplicity of notations. Let Ti be the set of nodes of R at depth i. Let Ly = 
L{D{v)) for every non-leaf node v of R. So, L{R) = Th = U„grfc_iL„. Consider 
an arbitrary tree T such that L(T) = L{R) and \T^ 0 i?^| <2. Then, for every 
v G Th-i, Vk{R[Ly\) is an {s{k) — l)-clique and |T^[L„] 0 < 2, so by 

Lemma 4, T[Ly] is 1-extensible and 1-away. Let fpiy) be the degree-2 node of 
T[Ly\. Let ay (respectively, /3„) be the unique leaf of D{v) (respectively, T[Ly\) 
such that dn{v,ay) = 1 (respectively, dri^piv), (iy) = 1). We may assume T is 
not 0-extensible. 

We claim that ay = (3y for all v G Th-i. For a contradiction, we assume 
Oiv Pv for some v G Tk-i. Let v' G Th -2 be the parent node of v in R. By 
construction, Vk- 2 {R[Ly']) is an (s(A:— 2) — l)-clique, soVk{R[{au ■ u G Lyi}]) is 
an (s(A:-2)-l)-clique. Since |T'=0i?'=| < 2, |T'=[{a„ : u G Ly,}]\ > 

2. In addition, driipiu), a„) > 1 for all u G Lyi, so if we let Tq = T[{lp{u) : u G 
Lyi}], then |Tg — 2. Lemma 4 (or Lemma 5 for A: = 7) therefore 
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determines the topology of Tq. Particularly, To is not 0-extensible; otherwise the 
subtree T[U„gi^,L„] would become 0-extensible since every T[T„] with v G L^i 
is 1-extensible. Let F be the tree obtained by connecting a new leaf to (p{v). By 
Lemma 6, |T^| < ~ while we have also — 2 since 

Oiv ^ Pv and |T^[{a„ : u G T„/}]| > — 2, a contradiction. 

Now, ay = Py for all v G T/i-i- Define a tree T' with L(T') = Fh-i 
from T as follows: For every v' G Fh-i, contract T[Ly] to (p{v) and iden- 
tify Lp{v) with V. Because dT{'^{v),ay) = 1 for all v G T^-i, dT'{u,v) = 
drioimOty) — 2 for all u,v G Fh-i- In addition, dn{u,v) = oiy) — 2, 

hence |(T')^“^0i?^“^[T^_i]| < 2. The induction hypothesis therefore shows that 
T' is 1-extensible and /ife_ 2 -away. Moreover, all T[T„], v G Fh-i, are 1-extensible 
and 1-away, so T is 1-extensible and hfc-away. 

Theorem 1. For every odd k>5, the graph Vk{Rk,\k/ 2 \-i) o (3, A:, [A:/2J — 
1,2) -core graph. 

These constructions, lemmas and theorems can be generalized to every fixed 
fc > 4 and Z\ > 3. A phylogenic fcth root of the s/i(fc)-clique can be constructed 
in the same way as and we denote it as Dap- We can construct a phylogeny 
RA,k,hh of degree A recursively in the same way as Rk,hk but replacing s and Di 
therein with sa and Da.i, respectively; further, if k is even then the function g{i) 
therein should be replaced by n}=i('®zi(2j -I- 2) — 1). Lemma 7 and Theorem 1 
can be generalized as follows: 

Theorem 2. Let fc > 4 and Z\ > 3. Let T he a tree of maximum degree A 
such that L(T) = L{RA.k,hk) o.'nd \T^ 0 khj\ — Then T is 0-extensihle or 
1-extensible. Moreover, if T is 1-extensible then it is hk-away. 

Theorem 3. For every fc > 4, Vk{RA,k,[k/ 2 ]-i) is a {A,k, [k/2\ — l,2)-core 
graph. 

4 The NP-Hardness of /ICPRfc 

This section proves that 3CPRfc is NP-hard for each odd A: > 3. It is straight- 
forward to generalize the arguments of this section to prove that ACPRA: is 
NP-hard for for every Z\ > 3 and fc > 3. 

Throughout this section, all trees and phytogenies are of maximum degree 3 
or less. Proofs of most lemmas and corollaries are omitted due to lack of space. 

We begin with the NP-hardness proof of 3CPR3 because the NP-hardness 
proofs of 3CPRA: for larger odd k are reductions from it. We reduce the following 
version of Hamiltonian Path Problem (HP) to 3CPR3, whose NP-hardness 
proofs can be found in [3] and [6, Section 9.3]. 

Hamiltonian Path Problem (HP): Given a graph G = (V,E) such that 

— all vertices are of degree 3 or less, 

— two specific vertices are of degree 1 and each of them is adjacent to a vertex 
of degree 2, and 

— there is no cycle of length less than 5, 
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find a Hamiltonian path of G, i.e., find a linear ordering of the vertices of G such 
that each pair of consecutive vertices are adjacent in G. 

For every graph G = (V, E) and every approximate phylogeny T of G, we 
define a function fc,T mapping each v € V to a, real number as follows: 

fG,T{v) = ^\{u : {m,u} G EanddT{u,v) > 3}| 

+ |{{u,w} : u ^ w, {u,v} € E, {v,w} € E, {u,w} ^ E anddriu^w) < 3}\ . 
Lemma 8. Y.vav fG,T{v) < \T^®E\. 

Lemma 9. Let v he a vertex of G having three pairwise nonadjaeent neighbors 
ui,U2 and U3. Then, fa,T{v) = \ or fG,T{v) > 1. Moreover, if fG,T{v) = \, 
then dT{ui,v) > 3 for one Ui G {mi,U2,U3} and driujjv) = 3 for the other two 
Uj G {UI,U2,U3,} - {mJ. 

Lemma 10. 3CPR3 is NP-complete. 

Proof. Let G = (V,E) be an arbitrary instance of HP. Let \V\ = n + 2. Let 
W C V he the set of vertices of degree 3 in G. Then |1F| is even and we denote 
it as 2£. Let vq and Vn+i be the two degree-1 vertices, and let vi and be the 
unique neighbors of Vq and v„ in G, respectively. Note that dc(vi) = dc(v„) = 2. 
Let G' = (V',E') be the graph obtained from G as follows. 

1. Add two new vertices vg and Vn+i- 

2. Add four new edges {uo,ho}, {uo,ui}, {v„+i,v„+i}, and {v„+i,v„}. 

It suffices to show that the input graph G has a Hamiltonian path if and only 
if G' has an approximate phylogeny T with error < £. 

Suppose that G has a Hamiltonian path vg, vi, V2, ■ ■ ■ , v„, v„+i, an ordering 
of the vertices where each pair of consecutive vertices are adjacent in G. An 
approximate phylogeny T of G' is then constructed as follows: T has 

1. n -I- 2 internal nodes Vq, . . . , 

2. n -I- 4 edges connecting the leaves to the internal nodes, namely, {uo,?;^}, 

and {vi, v'} for alH G {0, . . . , n -I- 1}, and 

3. n -I- 1 edges {v', for alH G {0, . . . , n} between the internal nodes. 

By the construction, \T^ 0 E'\ = |{{ui,Uj} G E : \i — j\ > 1}| = i. 

Conversely, suppose that T is an approximate phylogeny of G' that satisfies 
|r^ © E'\ < i. By Lemma 8, \T^ © E'\ > J2vev /g'.t(u) > J2vew /g'.t(u)- On 
the other hand, by Lemma 9, /g',t(v) > 5 for all v &W. So, £> \E' (B T^\ > 
\W\ /2 = £, where all inequalities must be equality. This shows that, 

(i) fG',T{v) = i for all v G W, and 

(ii) fG’,T{v) = 0 for all v &V — W . 

For each Vi G V, let u' be the internal node in T adjacent to Vi. We claim 
that Vq, . . . are different internal nodes of T. For a contradiction, assume 
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v[ = v'j for some Vi yf Vj. A contradiction is derived in each of the following 
cases. 

Case 1: Vi G W. Let ui,U 2 and U 3 be the three neighbors in G. If Vj G 
{u\,U 2 ,uz\, then fG>,T{vi) > 1 by Lemma 9. Otherwise, either m' = tt', or both 
driuijVi) > 3 and dT{uj,Vi) > 3 for some Ui yf Uj. In any case, fG',T{vi) > 1, a 
contradiction against (i). 

Case 2'. Vi G V — W — {uo,u„+i}. Let u\ and U 2 be the neighbors of Vi. If 
Vj G {ui,U 2 }, say U 2 = vj, and dT{ui,U 2 ) > 4, then fG',T{vi) > If M 2 = vj 
and dT{u\,U 2 ) < 3, then fG',T{vi) > 1- If vj ^ {ui, M 2 }, then either u} = u '2 or at 
least one of c?t(mi. Mi) or dT{u 2 , Vi) is larger than 3; In either case, fG',T{vi) > 5 - 
Therefore, in any case, we get a contradiction against (ii). 

Case 3: m, = vo and Vj = Vn+i - In this case, drivo, mi) > 4 or drivn, Vn+i) > 4 
holds. If drivQjVi) > 4, then /g',t(mq) > 5 . Similarly, if dT{vQ,Vn+i) > 4, then 
fG',T{vn+i) > 5 - In any case, we get a contradiction against (ii). 

Now, |{mq, . . . ,m(j_|_]^}| = n + 2. So, dxivijVj) > 3 for all Vi yf Vj. By Lemma 9 
and (i), for every Vi G W, v[ is adjacent to just two of the three nodes u[,U 2 , Mg 
in T where mi,U 2 ,U 3 are the neighbors of Vi in G. Moreover, by (ii), for every 
Vi G V — W — {vo, Vn+i}, v'i is adjacent to both of and u '2 where mi and M 2 are 
the neighbors of Vi in G. Consequently, the subtree T[{mq, . . . tv'^_^{\] becomes 
a Hamiltonian path starting from Mq and ending at which gives rise to a 

Hamiltonian path from vq to Vn+i in G. 

Theorem 4. For every odd k > 3, 3CPRA: is NP-complete. 

Proof. We have shown it for k = 3. Fix an arbitrary odd k > 5. By Theorem 1, 
we have a (3, k, h^, 2)-core graph Vk{Rk,hk) £^nd its kth phylogenetic root Rk,hk j 
where hk = [k/2\ — 1. Recall that Rk,hk has a unique degree-2 node a and there 
is a unique leaf u in Rk.h^. such that dji^ {ct,u) = hk- Moreover, for all leaves 
M yf u of Rk,hk , dria, v) > hk- We refer to this leaf u as the special leaf of Rk,hk 
and the corresponding vertex of Vk{Rk,hk) ^ts the special vertex of Vk{Rk,hk)- 
Let G = (V,E) be an instance of HP. We inherit the notions used in the 
proof of Lemma 10 (e.g. n = |P | — 2 and 2£ is the number of degree-3 vertices in 
G.) Let G' = {V ,E') be the graph constructed from G as in Lemma 10. Further, 
we construct a graph G" = {V" , E") from G' as follows: 

1 . Replace every vertex v in V with a copy G{v) of Vk{Rk) and identify v with 
the special vertex of G(m), referring to it as the special vertex of G{v). 

2. Let E" D E' , i.e. connect a pair of special vertices by an edge of G" if and 
only if the corresponding pair are adjacent in G' . 

Therefore, V C V" is the set of special vertices of G" . 

It suffices to show that the input graph G has a Hamiltonian path if and only 
if G" has an approximate phylogeny with error < £. 

Suppose that G has a Hamiltonian path mq, mi, M 2 , . . . , m„, m„+i, an ordering 
of the vertices of G. Let T be the approximate phylogeny of G' constructed in 
Lemma 10 from this Hamiltonian path. A required approximate phylogeny T" 
of G" is then reconstructed as follows: 
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1. Replace each leaf v & V oi T with a copy T{v) of tree Rk,hk and connect 
the degree-2 node of T{v) to the node adjacent to v in the original T. 

2. Label the leaves of T{v) by the vertices of G{v) in an arbitrary one-to-one 
manner such that Vk{T{v)) = G{v) and v is the special leaf of T{v). 

Conversely, let T be an arbitrary approximate phylogeny of G" with error 
< Our goal is to show the existence of a Hamiltonian path of G. 

Let Ly = R(G(u)). We define a function mapping each u G W to a real 
number as follows: 



fk{v) 



= i |{u G -/Vg/(u) : dT{u,v)>k}\ 

|{{u,w} : u ^ w,v & Ngi{u) Ng'{w),{u,w} ^ E' aYiddriu^w) < k}\ 



E 

UGNq/ (i!)U{i'} 



T>^[Lu](BE{G{u)) 
dG' (u) + 1 



Lemmas 8 and 9 still hold after replacing the function fG,T therein by the func- 
tion fk here. The proofs remain the same. 

We claim that fk (v) > ^ for every degree-3 vertex v of G. For a contradiction, 
assume fk{v) < Let ui,U 2 ,us be the neighbors of v in G. For every Ui, 
0 E{G{ui))\ < 2; otherwise we would have fk(v) > So by Theorem 1, 
is 1-extensible and hk-s^way. Similarly, T[Lj;] is 1-extensible and hk-away. 
By definitions, these four 1-extensible subtrees T[L„2], T[L„3] and T[L„] 

are mutually disjoint. Let Oj be the degree-2 node of and a' be its neighbor 

outside T[L„J. Similarly, let j3 be the degree-2 node of T[Lf\ and (3' be its 
neighbor outside T[L„]. Then, |{ai, «2, oa, /?}| = 4, and by arguments in the 
proof of Lemma 9, |{aj, /?^}| = 4. We have dT{ui,ai) = hk for every uf, 

otherwise drivjUi) > k+1 hence we would have fk{v) > 5- Similarly, dT{v,(3) = 
hk- Therefore, 7^3(T)[{ai, 02, as, /?}] = 'Pk(T)[{ui,U 2 ,U 3 ,v}]. Consequently, by 
the proof of Lemma 9, fk{v) > a contradiction. 

Now, fk{v) > ^ for every degree-3 vertex v of G. Together with the upper 
bound, we have 

(i) fk{v) = 5 for all vertices v of degree-3 in G and, 

(ii) fk{u) = 0 holds for all other vertices u in V . 

Hence T[Ly] is 1-extensible and hk-away for every v G V . Let be the 
unique degree-2 node of the subtree T[L„]. We have shown that driv, ay) = hk 
for all degree-3 vertices v of G; further, by (i) and (ii), dT{v,ay) = hk for all 

V G V'. Thus, 7^3(T[{a„ : v G R'}]) = Vk(T[V']). Let T" be the tree obtained 
from T by contracting every T[L„] to and identifying with v, for every 

V G V . Then, \T"^ 0 E'\ < l\ we know that G has a Hamiltonian path, as in the 
proof of Lemma 10. 



It is straightforward to generalize Theorem 4 to every Z\ > 3 and fc > 4, 
obtaining the following theorem. 

Theorem 5. For every Z\ > 3 and every k >3, ZlCPRA: is NP-complete. 
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5 Summary and an Open Question 

We have proved that Z\CPRA: is NP-complete for every Z\ > 3 and k > 3. A 
more fundamental problem is the Tree km Root Problem (TR/c), where 
the nodes (not only the leaves) of T correspond to the vertices of G. Kearney 
and Corneil proved that CTR/c is NP-complete when fc > 3 [4]. We conjecture 
that Z\CTRfc is NP-complete for every fixed Z\ > 3 and k >2. 
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Abstract. Given a set T of rooted triplets with leaf set L, we consider 
the problem of determining whether there exists a phylogenetic network 
consistent with T, and if so, constructing one. If no restrictions are placed 
on the hybrid nodes in the solution, the problem is trivially solved in 
polynomial time by a simple sorting network-based construction. For 
the more interesting (and biologically more motivated) case where the 
solution is required to be a level- 1 phylogenetic network, we present 
an algorithm solving the problem in 0(n®) time when T is dense (i.e., 
contains at least one rooted triplet for each cardinality three subset of L), 
where n — \L\. Note that the size of the input is 0{n^) if T is dense. We 
also give an 0(n®)-time algorithm for finding the set of all phylogenetic 
networks having a single hybrid node attached to exactly one leaf (and 
having no other hybrid nodes) that are consistent with a given dense set 
of rooted triplets. 



1 Introduction 

A phylogenetic network is a generalization of a phylogenetic tree in which internal 
nodes are allowed to have more than one parent. Phylogenetic networks are 
used to represent evolutionary relationships that cannot be adequately described 
in a single tree structure due to evolutionary events such as recombination, 
horizontal gene transfer, or hybrid speciation which suggest convergence between 
objects [8,9,17,18,20]. In fact, these types of events occur frequently in the 
evolutionary history of certain groups of organisms [17], but not much is known 
about the combinatorics related to phylogenetic networks [8]. Hence, to develop 
conceptual tools and efficient methods for computing with phylogenetic networks 
is an important issue. 

Some methods for constructing and for comparing phylogenetic networks 
have been proposed recently [4, 8, 17, 18, 20]. In this paper, we consider the prob- 
lem of constructing a phylogenetic network from a set of rooted triplets (see 
below for a formal problem definition). In particular, we assume that the input 
forms a dense set, meaning that the input contains at least one rooted triplet for 
each cardinality three subset of the objects being studied, and that the under- 
lying phylogenetic network is a level-1 network, meaning that each biconnected 
component in the undirected version of the network contains at most one node 
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which has two parents in the directed version of the network. The biological 
significance of level-1 phylogenetic networks, there referred to as galled-trees, 
is discussed in [8]. The rationale for assuming the input to consist of rooted 
triplets is that although computationally expensive methods for constructing 
reliable phylogenetic trees such as maximum likelihood are infeasible for large 
sets of objects, they can be applied to infer highly accurate trees for smaller, 
overlapping subsets of the objects (see, e.g., [3]). One may thus apply maximum 
likelihood to each cardinality three subset L' of the objects and then select the 
most likely rooted triplet for L' to get a dense input set^. Moreover, in some 
applications, the data obtained experimentally may already have the form of 
rooted triplets; for example, Sibley-Ahlquist-style DNA-DNA hybridization ex- 
periments (see [15]) can yield rooted triplets directly. 



1.1 Definitions 

A rooted triplet is a binary, rooted, unordered tree with three distinctly labeled 
leaves. The unique rooted triplet on leaf set {x, y, z} in which the lowest common 
ancestor of x and y is a proper descendant of the lowest common ancestor of x 
and z (or equivalently, where the lowest common ancestor of x and y is a proper 
descendant of the lowest common ancestor of y and z) is denoted by ({a;, y}, z). 
A set T of rooted triplets is called dense if for each {x, y, z} C L, where L is the 
set of all leaves occurring in T, at least one of the three possible rooted triplets 
({a;, y}, z), ({x, z}, y), and ({y, z}, x) belongs to T. 

A phylogenetic network is a connected, rooted, simple, directed acyclic graph 
in which: (1) each node has outdegree at most 2; (2) each node has indegree 1 
or 2, except the root node which has indegree 0; (3) no node has both indegree 1 
and outdegree 1; and (4) all nodes with outdegree 0 are labeled by elements from 
a finite set L in such a way that no two nodes are assigned the same label. From 
here on, nodes of outdegree 0 are referred to as leaves and identified with their 
corresponding elements in L. We refer to nodes with indegree 2 as hybrid nodes. 
For any phylogenetic network N, let IA{N) be the undirected graph obtained 
from N by replacing each directed edge by an undirected edge. N \s a, level- f 
phylogenetic network if every biconnected component in U{N) contains at most 
/ nodes that are hybrid nodes in N . Note that if / = 0 then N is a tree. 

We denote the set of leaves in a rooted triplet t or a phylogenetic network N 
by A{t) or A{N), respectively. A rooted triplet t is consistent with the phyloge- 
netic network A^ if t is an induced subgraph of N. See Fig. 1 for an example. A 
set T of rooted triplets is consistent with N if every ti € T is consistent with N. 

The problem we consider in this paper is: Given a set T = {t\, . . . ,tk} 
of rooted triplets, construct a level-1 phylogenetic network N with A{N) = 

^ A similar approach is used in the quartet method paradigm [14, 16] for reconstructing 
unrooted phylogenetic trees: first infer the unrooted topology of each cardinality four 
subset of the leaf set to obtain a complete set of quartets (unrooted, distinctly leaf- 
labeled trees each having four leaves and no nodes of degree two), then combine the 
quartets into an unrooted tree. 
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Fig. 1. Let N be the phylogenetic network on the left. The rooted triplet {{b,c},e) 
shown on the right is consistent with N. Note that ({c, e}, b) is also consistent with N. 



UtiGT A{ti) such that T is consistent with N, if such a network exists; otherwise, 
output null. Throughout this paper, L represents the leaf set Ut-er A{ti) in 
the problem definition above, and we write n = \L\ and k = \T\. Note that 
(3) < < 3 • (g), i.e., k = 0{n^) if the input is dense. 

Finally, for any set T of rooted triplets and L' C L, we define T | L' as the 
subset of T consisting of all rooted triplets ({a;, y}, z) with {a;, y, z} C L' . 



1.2 Related Work 

Aho, Sagiv, Szymanski, and Ullman [1] presented an 0(fcn)-time algorithm for 
determining whether a given set of k rooted triplets on n leaves is consistent 
with some rooted, distinctly leaf-labeled tree (i.e., a level-0 phylogenetic net- 
work), and if so, returning such a tree. Several years later, Henzinger, King, and 
Warnow [10] showed how to implement the algorithm of Aho et al. to run in 
min{0(fcn°'®), 0{k + \ogn)"\ time. G^sieniec, Jansson, Lingas, and Ostlin [6] 

considered a version of the problem where the leaves in the output tree are re- 
quired to comply with a left-to-right leaf ordering given as part of the input. 
Related optimization problems where the objective is to construct a rooted tree 
consistent with the maximum number of rooted triplets in the input or to find 
a maximum cardinality subset L' of L such that T j L' is consistent with some 
tree have been studied in [2,6,7,12] and [13], respectively. 

The analog of the problem considered by Aho et al. for unrooted trees is NP- 
hard, even if all of the input trees are quartets [19]. Fortunately, certain useful 
optimization problems involving quartets can be approximated efficiently [14, 
16]. For a survey on quartet-based methods for inferring unrooted phylogenetic 
trees and related computational complexity results, see [16]. 

Nakhleh, Warnow, and Linder [17] gave an algorithm for reconstructing a 
level- 1 phylogenetic network from two distinctly leaf-labeled, binary, rooted, un- 
ordered trees with identical leaf sets. It runs in time which is polynomial in the 
number of leaves and the number of hybrid nodes in the underlying phylogenetic 
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network. They also considered the case where the two input trees may contain 
errors but where only one hybrid node is allowed. 

We remark that the deterministic algorithm for dynamic graph connectivity 
employed in the algorithm of Henzinger et al. [10] mentioned above can in fact be 
replaced with a more recent one due to Holm, de Lichtenberg, and Thorup [11] 
to yield the following improvement. 

Lemma 1. [13] The algorithm of Aho et al. can he implemented to run in 
mm{0 {k log^ n), 0{k + log n)} time. 

1.3 Our Results and Organization of the Paper 

We observe that if no restriction is placed on the level of the phylogenetic net- 
work, then the problem can be trivially solved using a sorting network-based con- 
struction in Section 2. Next, in Section 3, we present an 0(n^)-time algorithm 
called OneHyhridLeaf for inferring all phylogenetic networks with one hybrid 
node to which exactly one leaf is attached that are consistent with a given dense 
set of rooted triplets. This algorithm is subsequently used in Section 4, where we 
give a more general algorithm called LevelOne for constructing a level-1 phylo- 
genetic network consistent with T (if one exists) in 0{n^) time when T is dense. 

2 Constructing an Unrestricted Phylogenetic Network 

Given a set T of rooted triplets, we can always construct a level-/ phylogenetic 
network N where / is unrestricted such that N is consistent with T. Moreover, 
the construction can be carried out in time which is polynomial in the size of T 
as follows. Let P be any sorting network (see, e.g., [5]) for n elements with 
a polynomial number p of comparator stages. Build a directed acyclic graph Q 
from P with {p+l)-n nodes {Qij | 1 < i < P+1, I < j < n} such that there is a 
directed edge {Qij, Qi+ij) for every I < i < p and 1 < / < n, and two directed 
edges {Qij, Qi+i^k) and {Qi^k, Qi+ij) for every comparator (/, k) at stage z in P 
for 1 < z < p. Then, for 1 < j < n — 1, add the directed edge {Qij, Qij+i). See 
Fig. 2. Finally, distinctly label the nodes {Qp+ij 1 1 < j < n} by P, and for each 
node in Q having indegree 1 and outdegree 1 (if any), contract its outgoing edge 
to obtain N. It is easy to show that for any {a;, y, z} C L, all three of ({a;, y}, z), 
{{x,z},y), and {{y,z},x) are consistent with N. 

3 Constructing All Phylogenetic Networks 
Having One Hybrid Node with One Attached Leaf 

This section presents an algorithm called OneHyhridLeaf for inferring the set 
of all phylogenetic networks having a single hybrid node attached to exactly 
one leaf (and having no other hybrid nodes) which are consistent with a given 
set T of rooted triplets. This algorithm is later used as a subroutine by the main 
algorithm in Section 4. OneHyhridLeaf assumes that its given set T of rooted 
triplets is dense. We first note the following. 
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Fig. 2. The sorting network P on the left yields a directed acyclic graph Q. 



Lemma 2. Let N be any phylogenetic network consistent with a setT of rooted 
triplets with leaf set L such that N has a hybrid node h to which exactly one 
leaf c is attached and N has no other hybrid nodes. If h and c and all their 
incident edges are deleted and then, for every node with outdegree 1 and indegree 
less than 2, its outgoing edge is contracted, then the resulting graph is a binary 
tree consistent with T \{L\ {c}). 

Lemma 3. Let P be a dense set of rooted triplets and let L be the leaf set ofP. 
There is at most one rooted, unordered tree distinctly leaf-labeled by L which is 
consistent with T . Furthermore, if such a tree R exists then it must be binary. 

Proof Suppose there exist two unordered, distinctly leaf-labeled trees R and R' 
consistent with T such that R ^ R' . Then, for some x,y,z € L, ({x, y}, z) is con- 
sistent with R while {{x,z},y) is consistent with R' . Since T' is dense, at least 
one of ({a;, y}, z), ({a;, z}, y), and {{y, z}, x) belongs to T. This yields a contradic- 
tion in all cases because R cannot be consistent with ({x, z}, y) or {{y, z}, x) and 
R' cannot be consistent with {{x,y},z) or {{y,z},x) since R and R' are trees. 

Next, suppose R is not binary. Then R has a node u with degree greater than 
two. Let X, y, and z be leaves from three different subtrees rooted at children of u. 
T is dense, so at least one of ({x, y}, z), ({x, z}, y), and {{y, z}, x) belongs to T. 
But none of these three rooted triplets is consistent with R. Contradiction. □ 

Our algorithm OneHybridLeaf is shown in Fig. 3. It tests every c G L as 
the leaf attached to the hybrid node. For each such candidate c, it first calls a 
procedure BuildTree to obtain a binary tree R which is consistent with all rooted 
triplets in T that do not involve the leaf c, if such a tree exists. (T is dense, so 
the set T I (L \ {c}) is also dense. Thus, Lemma 3 ensures that if R exists then it 
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Algorithm OneHybridLeaf 

Input: A dense set T of rooted triplets with leaf set L. 

Output: The set of all phylogenetic networks which are consistent with T having 
exactly one hybrid node to which there is exactly one leaf attached. 

1 SetAf=0. 

2 for each c £ L do 

2.1 Let R — BuildTree{T \ {L \ {c})). 

2.2 if i? 7 ^: null then 

2.3 for every pair of nodes u and v in R, where u ^ v and v is not the 
root of R do 

2.3.1 Form a phylogenetic network N from R as follows. Create three 
new internal nodes p, q, and h, insert the four edges (p,u), (p,h), 
{q, v), and (g, h), and attach a leaf child labeled by c to h. Replace 
the edge (vo,v) leading to v by the edge (no, q)- If « is not the root 
then replace «’s incoming edge {uo,u) by the edge (uo,p). 

2.3.2 If N is consistent with all rooted triplets in T involving the leaf c 
then let AT = AT LI {N}. 

endfor 

endfor 

3 return AA. 

End OneHybridLeaf 



Fig. 3. Constructing phylogenetic networks with one hybrid node. 



is uniquely determined and binary.) Then, it tries all possible ways to obtain a 
phylogenetic network from R by inserting a hybrid node h attached to the leaf c, 
and keeps all resulting networks which are also consistent with the rest of T. By 
Lemma 2, all valid networks will be found by OneHybridLeaf. 

To implement BuildTree, we use the fast version of the algorithm of Aho et al. 
referred to in Lemma 1. If L contains at least four elements then BuildTree {T\{L\ 
{c})) is the algorithm of Aho et al. applied to T | (L \ {c}) (we may assume it 
returns null if it fails). For the case \L\ = 3, the set T | (L\ {c}) is empty and we 
simply let BuildTree{T \ {L \ {c})) return a tree with the two leaves in L \ {c}. 
Lemma 4. The time complexity of Algorithm OneHybridLeaf is O(n^). 

Proof. Step 2 iterates Steps 2. 1-2.3 n times. In each iteration, Step 2.1 takes 
0{k + n^ logn) time by Lemma 1. The inner for-loop (Step 2.3) considers 0{n^) 
pairs of nodes of R; for each such node pair. Step 2.3.1 takes 0(1) time and 
Step 2.3.2 takes 0{n^) time. In total. Step 2.3 uses 0(n^ • (1 -I- n^)) = O(n^) 
time, so Step 2 takes 0{n-{k+n^ logn-bn^)) time. Furthermore, k =\T\ = O(n^). 
Thus, the total running time of Algorithm OneHybridLeaf is 0(n®). □ 

4 Constructing a Level-1 Phylogenetic Network 

Here, we present an algorithm called LevelOne for inferring a level-1 phylogenetic 
network (if one exists) consistent with a given dense set T of rooted triplets. The 





468 



Jesper Jansson and Wing-Kin Sung 



basic idea of our algorithm is to partition the leaf set of T into disjoint subsets 
which we call SN-sets, run LevelOne recursively to construct a level-1 network 
for each SN-set, and then apply Algorithm OneHyhridLeaf from Section 3 to 
combine the computed networks for the SN-sets into one level- 1 network. 

We first introduce the concept of an SN-set. Let L be the leaf set of T. For 
any X C L, define the set SN{X) recursively as SN{X U {c}) if there exists 
some c G L\X and X\,X2 & X such that ({a;i, c}, X2) G T, and as X otherwise. 
Below, we study some properties of the SN-sets. 

Lemma 5. SN{{x,y}) for any x,y G L is computable in O(n^) time. 

Proof. It x = y then SN{{x,y}) = {a;} can be obtained in 0(1) time. It x =/= y 
then SN{{x, y}) can be computed by calling Algorithm ComputeSN{x, y) shown 
in Fig. 4. Initially, the algorithm sets X = {x} and Z = {y}. Then, while Z 
is nonempty, it selects any z G Z, augments Z with all leaves c not already 
in A U Z such that {{a,c},z) or {{z,c},a) G T for some a G X, and finally 
removes 2 from Z and inserts z into X. To analyze the time complexity of 
Algorithm ComputeSN, observe that one leaf is transferred from Z to A in each 
iteration of the while-loop and that a leaf which has been moved to A can 
never be moved back to Z, so Steps 2. 1-2.3 are iterated at most n — 1 times. 
Inside the while-loop, the algorithm scans O(n^) rooted triplets at most once 
to augment Z. The total running time of ComputeSN is therefore O(n^). □ 



Algorithm ComputeSN 

Input: A dense set T of rooted triplets with leaf set L. Two leaves x,y G L. 

Output: SN{{x,y}). 

1 Set A = {a;} and Z = {y}. 

2 while Z 9 do 

2.1 Let 2 : be any element in Z. 

2.2 for every a G X do 

If there exists some c G L \ (A U Z) such that ({o, c}, z) G T or 
{{z, c},a) G T then Z — Z U {c}. 
endfor 

2.3 Set A = A U {z} and Z = Z \ {z}. 

endwhile 

3 return A. 

End ComputeSN 



Fig. 4. Computing SN{{x,y}). 



Lemma 6. If T is dense then for any A,B C L, SN{A) n SN{B) equals 0, 
SN{A), orSN{B). 

Proof. Suppose on the contrary that Zi,Z2 G SN{A), 22,^3 G SN{B), Z3 ^ 
SN{A), and zi ^ SN{B). Consider the rooted triplet on the three leaves 21, 22, 
and 23. Since T is dense, at least one of the following three cases must occur: 
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— Case 1 : ({22,23}, 21) € T. Then, by definition, 23 G SN{A). 

— Case 2 : ({21,231,22) € T. Then, by definition, 23 G SN{A). 

— Case 3 : ({21, 22}, 23) G T. Then, by definition, 21 G SN{B). 

In each of the three cases, we have a contradiction. Thus, the lemma follows. □ 

In particular. Lemma 6 holds for all subsets of L of cardinality one or two. 
For any X\,X2 G L (possibly with x\ = X2), S'fV({a;i, X2}) is called trivial if 
S' 7 V({a;i, X2}) = L, and S'fV({a;i, X2}) is called maximal if it is nontrivial and not 
a proper subset of any nontrivial SN[{yi, j/2}), where yi,y2 G L. Let SAf be the 
set of all maximal SN-sets of the form S'fV({a;i, X2}), where possibly xi = a;2. 
Since T is dense, SAf forms a partition of the set L by Lemma 6. Furthermore, 
SAf is uniquely determined. Write SAf = {S'A^i, S'A^2, • • ■ , SNq} and introduce 
q new symbols m, 02, • ■ • , Og- (Observe that <? > 2 if |L| > 2 .) We define a 
function / as follows. For every x G L, let f{x) = at if x G SNi. Let T' be the 
set {{{f{x),f{y)},f{z)) : {{x,y},z) G T and f (x) , f (y) J (z) all differ}. 

Lemma 7 . Suppose T is consistent with a level-1 phylogenetic network. If q = 2 
then the tree distinctly leaf-labeled by ai and 02 is consistent with T' . If q > 3 
then there exists a phylogenetic network having a single hybrid node attached to 
exactly one leaf (and having no other hybrid nodes) that is consistent with T' . 

We also have: 

Lemma 8. Suppose T' is consistent with a level-1 phylogenetic network N' with 
leaf set {ai , . . . , a^j. Let N be a level-1 network obtained from N' by replacing 
each ai by a level-1 network Ni with leaf set SNi consistent with T| SNi. Then 
N is consistent with T . 

Proof. Let t be any rooted triplet in T and write t = ({a;, y}, 2). If a; G SNi, y G 
SNj, and 2 G SNk, where i,j, k all differ, then t is consistent with N (otherwise, 
= ({/( 3 ^)> /(y)|j /(- 2 ^)) = cannot be consistent with N' which is 

a contradiction since t' G T'). If a;,y G SNi and 2 G SNj with i j then t is 
consistent with N by the construction of N. The case x,z G SNi and y G SNj 
(or symmetrically, y,z G SNi and x G SNj) with i yf j is not possible since 
x,z G SNi implies y G SNi. If x, y, z belong to the same SNi then t is consistent 
with Ni and therefore with N. In all cases, t is consistent with N. □ 

Our main algorithm LevelOne is listed in Fig. 5 . Its correctness follows from 
Lemmas 7 and 8. 

Theorem 1 . When T is dense, we can determine if there exists a level-1 phy- 
logenetic network consistent with T , and if so construct one, in 0(n®) time. 

Proof. Apply Algorithm LevelOne to T. For any L' C L, let g{L') be the running 
time of LevelOne{T \ L'). In Step I of the algorithm, we compute SN{{xi,X2}) 
for the pairs (a;i,a;2) in Lx L. By Lemma 5 , Step 1 takes 0{n^) time. Step 2 
can be performed in OfrA) time, and Step 3 takes '^sn-gSM time. Step 5 

can be done in 0{n^) time according to Lemma 4 . In total, we have g{L) = 
J^SN-eSJG + O(n^). Since all sets in SAf are disjoint, g{L) = 0 (n®). □ 
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Algorithm LevelOne 

Input: A dense set T of rooted triplets with leaf set L. 

Output: A level-1 network N consistent with T, if one exists; otherwise, null. 

1 for every xi £ L and X 2 £ L (including xi = X2) do 

Compute SN{{xi,X 2 }. 

endfor 

2 Let 5A/” = {SNi, SN 2 , . . . , SNq} be the set of all maximal SA^xi, 0 : 2 }). 

3 for every SNi £ SAf do 

If \SNi\ > 3 then Ni = LevelOne{T \ SNi)-, else, let Ni be a tree distinctly 
leaf-labeled by SNi. 

endfor 

4 If Ni for any i £ {1, . . . , q} equals null then return null. 

5 If q = 2 then let A be a network with a root node connected to Ni and N 2 . 
Otherwise (q > 3), build T' from T, compute N = OneHybridLeaf {T'), and 
check if N is empty; if yes then let N = null, else select any N' £ N and form 
a network N by replacing each at in N' with Ni. 

6 return N. 

End LevelOne 



Fig. 5. Constructing a level-1 phylogenetic network. 



Algorithm LevelOne can be modified to return all level-1 phylogenetic net- 
works consistent with T by utilizing all the possible topologies returned by One- 
HybridLeaf. However, the running time may then become exponential since some 
inputs are consistent with an exponential number of different level-1 networks. 
(At each recursion level, although the partition of the leaves into SAf is unique 
when the input is dense, there may be more than one way to merge the recur- 
sively computed subnetworks for the SN -sets into a valid network.) 

5 Conclusion 

This paper presents a polynomial-time algorithm for inferring a level-1 phylo- 
genetic network from a dense set of rooted triplets. This problem is not only 
interesting from a combinatorial point of view, but also biologically sound since 
rooted triplets can be obtained accurately by using maximum likelihood or di- 
rectly through experiments. In the future, we plan to further improve the time 
complexity of our main algorithm and to investigate the computational com- 
plexity of the problem when T is not dense. Also, we would like to know if it is 
possible to construct a level-/ phylogenetic network from a dense set of rooted 
triplets in polynomial time for any constant / > 1. 
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